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Abstract: EMU is a wide-field radio continuum survey planned for the new Aus- 
tralian Square Kilometre Array Pathfinder (ASKAP) telescope. The primary goal 
of EMU is to make a deep (rms ~ 10/iJy/beam) radio continuum survey of the 
entire Southern Sky at 1.3 GHz, extending as far North as +30° declination, with a 
resolution of 10 arcsec. EMU is expected to detect and catalogue about 70 million 
galaxies, including typical star-forming galaxies up to z~l, powerful starbursts to 
even greater redshifts, and AGNs to the edge of the visible Universe. It will un- 
doubtedly discover new classes of object. This paper defines the science goals and 
parameters of the survey, and describes the development of techniques necessary 
to maximise the science return from EMU. 
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1 Introduction 

1.1 Background 

Deep continuum surveys of the radio sky have a 
distinguished history both for discovering new classes 
of object and for providing radio counterparts to 
astronomical objects studied at other wavelengths. 
The earliest large surveys, such as the 3C cata- 



logue (Edge et al. 1 1959 ) and the Molonglo Ref- 



erence Catalogue ( Large et al.||1981 ), gave us the 



first insight into the physics of radio galaxies and 
radio-loud quasars, but were insufficiently sensi- 
tive to detect any but the nearest radio-quiet or 
star-forming galaxies. Later radio surveys reached 
flux densities where normal star-forming galaxies 
were detected, but were still largely dominated 
by radio- loud active galactic nuclei (AGN). Only 
very long integrations in narrow deep fields made 
it possible to start probing star-forming galaxies 
beyond the local Universe. This paper describes 
a planned survey, EMU (Evolutionary Map of the 
Universe), which will reach a similar sensitivity (~ 
10 /iJy/beam) to those deep surveys, but over the 



entire visible sky. At that sensitivity, EMU will be 
able to trace the evolution of galaxies over most of 
the lifetime of the Universe. 

Fig. 1 shows the major 20-cm continuum ra- 
dio surveys. The largest existing radio survey, 
shown in the top right, is the wide but shallow 
NRAO VLA Sky Survey (NVSS), whose release 
paper ( Condon et al. 1998[ ) is one of the most cited 
papers in astronomy. The most sensitive existing 
radio survey i s the deep but narrow L ockman Hole 
observation ( Owen fc Morison |2008[ ) in the lower 
left. All current surveys are bounded by a diagonal 
line that roughly marks the limit of available tele- 
scope time of current-generation radio telescopes. 
The region to the left of this line is currently unex- 
plored, and this area of observational phase space 
presumably contains as many potential new dis- 
coveries as the region to the right. 

The Square Kilometre Array (SKA) is a pro- 
posed major internationally-funded radio telescope 



(Dewdney et al. 20091 whose construction is ex- 
pected to be completed in 2022. It will be many 
times more sensitive than any existing radio tele- 
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Figure 1 Comparison of EMU with existing deep 
20 cm radio surveys. Horizontal axis is 5-cr sen- 
sitivity, and vertical axis shows the sky coverage. 
The diagonal dashed line shows the approximate 
envelope of existing surveys, which is largely de- 
termined by the availability of telescope time. The 
squares in the top-left represent the EMU survey, 
discussed i n this paper, and the c omplementary 
WODAN (|Rottgering et al.||2010b|) survey which 



has been proposed for the upgraded Westerbork 
telescope to cover the sky North of +30° . Surveys 
represented by diagonal lines are those which range 
from a wide shallow area to a smaller deep area. 
The horizontal line for ATLAS extends in sensitiv- 
ity from the intermediate published data releases 



([Norris et al.||2006[ |Middelberg et al.|2008a||Hales 
et al. 2011) to the final data release ( jBanfield et 
al. ||2011 1 ). 



scope, and will answer fundamental questio ns about 
the Universe ( |Carilli fe Rawlings 2004). It is 
likely to consist of between 1000 and 1500 15-meter 
dishes in a central area of diameter 5 km, sur- 
rounded by an equal number of dishes in a region 
stretching up to thousands of km. 

The Australian SKA Pathfinder (ASKAP) is a 
new radio telescope being built both to test and 
develop aspects of potential SKA technology, and 
to develop SKA science. ASKAP is being built 
on the Australian candidate SKA site in Western 
Australia, at the Murchison Radio-astronomy Ob- 
servatory, with a planned completion date of late 
2012. In addition to developing SKA science and 
technology, ASKAP is a major telescope in its own 
right, likely to generate significant new astronom- 



ical discoveries. 



1.2 ASKAP 



ASKA P \ Johnston et al.|2007[|2008)|Deboer et al. 
2009) will consist of 36 12-metre antennas spread 
over a region 6 km in diameter. Although the 
array of antennas is no larger than many exist- 
ing radio telescopes, the feed array at the focus 
of each antenna is revolutionary, with a phased- 
array feed (PAF: |Bunton fc Hay ||2010[ ) of 96 
dual-polarisation pixels, designed to work in a fre- 
quency band of 700-1800 MHz, with an instanta- 
neous bandwidth of 300 MHz. This will replace 
the single-pixel feeds that are almost universal in 
current-generation synthesis radio telescopes. As 
a result, ASKAP will have a field of view up to 
30 deg 2 enabling it to survey the sky up to thirty 
times faster than existing synthesis arrays, and al- 
lowing surveys of a scope that cannot be contem- 
plated with current-generation telescopes. To en- 
sure good calibration, the antennas are a novel 3- 
axis design, with the feed and reflector rotating to 
mimic the effect of an equatorial mount, ensuring 
a constant position angle of the PAF and sidelobes 
on the sky. The pointing accuracy of each antenna 
is significantly better than 30 arcsec. 



The ASKAP array configuration ( Gupta et al. 



2008) balances the need for high sensitivity to ex 



tended structures (particularly for neutral hydro- 
gen surveys) with the need for high resolution for 
continuum projects such as EMU. To achieve this, 
30 antennas follow a roughly Gaussian distribu- 
tion with a scale of ~ 700 m, corresponding to a 
point spread function of ~ 30 arcsec using natural 
weighting, with a further six antennas extending 
to a maximum baseline of 6 km, corresponding 
to a point spread function of ~ 10 arcsec using 
uniform weighting. The positions of the anten- 
nas are optimised for uv coverage (i.e. coverage in 
the Fourier plane) between declination —50° and 
+10°, but give excellent uv coverage between dec- 
lination -90° and +30°. 

The PAF is still under development, but the 
performance of prototypes gives us confidence that 
the EMU survey is feasible as planned. The PAF 
will consist of 96 dual-polarisation receivers, each 
with a system temperature ~ 50 K, which are com- 
bined in a beam- former to form up to 36 beams. 
Each of these beams has the same primary beam 
response as a single-pixel feed (~ 1.2° full- width 
half- maximum at 1.4 GHz), distributed in a uni- 
form grid across an envelope of 30 deg 2 . The op- 
timum weighting and number of beams is still be- 
ing studied, but the current expectation is that 36 
beams will be used for EMU, with the sensitivity 
over the 30 deg 2 FOV (field of view) expected to 
be uniform to ~ 20%. This will be improved to 
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<10% uniformity by dithering, with no significant 
loss of sensitivity, so that the images from the 36 
beams can be jointly imaged and deconvolved as 
a single image covering the FOV. Consequently, it 
is expected that the telescope will dwell on one 
position in the sky for 12 hours, reaching an rms 
sensitivity of ~ 10 /iJy/beam over a ~ 30deg 2 
FOV. The strategy for achieving this is still under 
development, and is discussed in §3.6| 

Although high spatial resolution is essential for 
EMU, the short spacings of ASKAP also deliver 
excellent sensitivity to low-surface brightness emis- 
sion, which is essential for a number of science 
drivers such as studies of radio emission from nearby 
clusters (j |2.7[ ). The ~ 10/iJy/beam rms contin- 
uum sensitivity in 12 hours is approximately con- 
stant for beam sizes from 10 to 30 arcsec, then 
increases to ~ 20/xJy/beam for a 90 arcsec beam 
and ~ 40 /xJy/beam for a 3 arcmin beam. 

Science data processing ( |Cornwell et al.|2011| 
will take place in an automated pipeline processor 
in real time. To keep up with the large data rate 
(~ 2.5 GB/s, or 100 PB/year), all science data pro- 
cessing steps, from the output of the correlator to 
science-qualified images, spectra, and catalogues, 
are performed in automated pipelines running on 
a highly distributed parallel processing computer. 
These steps include flagging bad data, calibration, 
imaging, source-finding, and archiving. 

A typical ASKAP field will contain about 50 Jy 
of flux in compact or slightly resolved sources. ASKAP 
can observe the entire visible 20 cm continuum sky 
to an rms sensitivity of ~1 mJy/beam in one day, 
so that initial observations will produce a global 
sky model (an accurate description of all sources 
stronger than ~ 1 mjy) which significantly simpli- 
fies subsequent processing, as strong sources will 
be subtracted from the visibility data before pro- 
cessing. This sky model also means that antenna 
complex gains can be self-calibrated in one minute 
without any need to switch to calibrator sources. 
It is expected that individual receiver gains will be 
sufficiently stable that the dominant causes of an- 
tenna complex gain variation (i.e. ionosphere and 
troposphere) will be common to all pixels, so that 
a gain solution in one beam of an antenna can be 
transferred to other beams of that antenna. 

In continuum mode, ASKAP will observe a 300 
MHz band, split into 1 MHz channels, with full 
Stokes parameters measured in each channel. The 
data will be processed in a multi-frequency synthe- 
sis mode, in which data from each channel are cor- 
rectly gridded in the uv plane. As well as produc- 
ing images and source catalogues, the processing 
pipeline will also measure spectral index, spectral 
curvature, and all polarisation products across the 
band. 

Completion of the Boolardy Engineering Test 



Array (BETA) , which is a 6-antenna subset of ASKAP, 
is expected in late 2011. BETA will be equipped 
with 6 PAFs, and all the necessary beamformers, 
correlators, and processing hardware to produce 
images over the full 30 deg 2 field. The primary goal 
of BETA is to enable engineering tests and com- 
missioning activities while the remaining ASKAP 
hardware is being constructed. If engineering com- 
missioning proceeds as expected, science observa- 
tions on BETA will commence in 2012, on a small 
number of test fields on which good radio-astronomical 
and ancillary data already exist. These test BETA 
observations will be used to debug and fine-tune 
not only ASKAP, but also the processes for han- 
dling the data. 

The full ASKAP array is expected to be com- 
missioned in early 2013, and the science surveys 
are expected to start in late 2013. There is no 
proprietary period on ASKAP data, with all data 
being placed in the public domain after quality 
control, so data are expected to start flowing to 
the astronomical community by the end of 2013. 

Expressions of Interest for ASKAP survey projects 
were sought in November 20 08, and full prop osals 
were solicited in mid-2009 (IBall et al.||2009|). Of 



the 38 initial expressions of interest, ten propos- 
als were eventually selected, with two, EMU (Evo- 
lutionary Map of the Universe) and WALLABY 
(Wide-fiel d ASKAP L-band Lega cy All-sky Blink 

2011[ ), being selected 



SurveY: Koribalski et al. 



as highest priority. EMU is an all-sky continuum 
survey whilst WALLABY is an all-sky survey for 
neutral hydrogen. ASKAP design is now being 
driven by the requirement to maximise the science 
return from these ten projects, with a particular 
focus on maximising the science from EMU and 
WALLABY. 

It is planned that EMU, WALLABY, and some 
other projects will observe commensally, i.e., they 
will agree on an observing schedule, and will ob- 
serve the sky in both continuum and HI modes 
at the same time, splitting the two data streams 
into two separate processing pipelines. More in- 
formation on all the ASKAP projects, including 
links to their individual websites, can be found on 



http://askap.org 



1.3 EMU 

The primary goal of EMU is to make a deep (10 
/iJy/beam rms) radio continuum survey of the en- 
tire Southern Sky, extending as far North as +30° . 
EMU will cover roughly the same fraction (75%) 
of the sky as th e benchmark NVSS survey (Con- 
don et aL||1998 ), but will be 45 times more sensi- 
tive, and will have an angular resolution (10 arc- 
sec) 4.5 times better. Because of the excellent 
short-spacing uv coverage of ASKAP, EMU will 
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Figure 2 A representation of the EMU sky cov- 
erage in Galact ic coordinates ove rlaid on 23 GHz 
WMAP data (iGold et al.||2011|). The dark area 



in the top left is the part of the sky NOT covered 
by EMU. 



aL |2008a| |Hales et al.||2011| |Banfield et aL]|2011[ ) . 

Surveys at this depth extend beyond the tradi- 
tional domains of radio astronomy, where sources 
are predominantly radio-loud galaxies and quasars, 
into the regime of star-forming galaxies. At this 
depth, even the most common active galactic nu- 
clei (AGN) are radio-quiet AGNs, which make up 
most of the X-ray extragalactic sources. As a re- 
sult, the role of radio astronomy is changing. Whereas 
most traditional radio-astronomical surveys had 
most impact on the niche area of radio- loud AGNs, 
current radio- astronomical surveys are dominated 
by the same galaxies as are studied by optical 
and IR surveys, making radio-astronomical sur- 
veys such as EMU an increasingly important com- 
ponent of multi-wavelength studies of galactic evo- 
lution. 



also have higher sensitivity to extended structures. 
The sky coverage of EMU is shown in Figj2j and 
the EMU specifications are summarised in Table 
[TJ Like most radio surveys, EMU will adopt a 5- 
g cutoff, leading to a source detection threshold 
of 50 /iJy/beam. EMU is expected to generate a 
catalogue of about 70 million galaxies, and all ra- 
dio data from the EMU survey will be placed in 
the public domain as soon as the data quality has 
been assured. 

Table 1 EMU Specifications 



Instantaneous FOV 
Area of survey 

Synthesised beamwidth 
Frequency range 
Rms sensitivity 
Total integration time 
Number of sources 



30 deg 2 
entire sky south 
of +30° dec. 
10 arcsec FWHM 
1130-1430 MHz 
10 /iJy/beam 
~ 1.5 years 1 
~ 70 million 



1 The primary specification is the sensitivity, 
rather than the integration time. If for any reason 
ASKAP is less sensitive than expected, EMU will 
increase the integration time rather than lose 
sensitivity. Conversely, an increase in sensitivity 
of ASKAP may reduce the total integration time. 



Currently, only a total of about 5deg 2 of the 
sky has been surveyed at 20 cm to the planned 
10/iJy/beam rms of EMU, in fields such as the 
Hubble, Chandra, COSMOS and Phoenix deep fields 



2006 



(|Huynh et al 2005 M iller et al.| 2008| |Schinnerer 
et al 1120071 |Hopkins et al ; ||2003| |Biggs fc Ivison 



Morrison et al.||2010| ), with a further 7 deg 2 



expected in the immediate future as part of the 
ATLAS survey ( |Norris et aL|2006| |Middelberg"et 
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Figure 3 Distribution of differential radio source 
counts at 1.4 GHz, based on and updated from 
the distribution shown in Hopkins et al. ( |2003[). 
The solid c ur ve is the polynomial fit from Hop- 
kins et al. (20031, the dashed curve is an up- 



dated polynomial fit and is the one used to esti- 
mate the EMU source numbers. The horizontal 
dot-dashed line represents a non-evolving popula- 
tion in a Euclidean universe. The shaded region 
shows the prediction based on fluctuations due to 
weak confusing sources 



Condon 


( 


1974 


); 


Mitchell & Condon. 


(1985) 



Because only a small area of sky has been sur- 
veyed to the depth of EMU, it is difficult to es- 
timate precisely how many galaxies it will detect. 
Most surveys to this sensitivity cover only a small 
area of sky, so that source counts at this level 
are significantly affected by sample variance, com- 
pleteness, and bias issues. Our estimate for the 
number per deg 2 above a flux density of 50 /i/Jy/beam 
is based on an extrapolation from source cou nts at 
higher flu x densities (2263 sources/ deg 2 ; |Jack-| 
son |2 005 ) , the compilation shown in Fig. [3] (2278 
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and the COSMOS survey (2261 



Scoville et al. 



2007 



Schinnercr ct 



al. 2007J) . These three figures are in good agree- 
ment and predict a total of ~ 70 million sources 
in EMU, which is therefore the number adopted 
throughout this paper. 

Estimating the fraction of these radio sources 
which are AGN is difficult. Below 1 mJy, star- 
forming galaxies start to become a major compo- 
nent of the 1.4 GHz source counts, dominating be- 
low ~ 0.15 mJy (Seymour et al. 2008 Ibar et 



al. 20091, but, even at these levels, there is still 



a significant proportion of low-luminosity AGNs 
Oarvis et al.||2004l lAfonso et al. 1120051 120061 |Nor 



Seymour et al. 



ris et al.||2006 Simpson et al. [2006 Smolcic et al 



2008 |Seymour et al.||2008| |Mignano et al.||2008 
Padovani et al.||2009[ )' 



( |2008[ ) have presented the 
most comprehensive attempt so far to divide radio 
sources into AGN and SF galaxies, and their re- 
sult, together with other recent estimates, is shown 
in Fig. [4] From these we estimate that about 75% 
of EMU sources will be star-forming galaxies. 
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Figure 4 Differential fraction of star- forming galax- 
ies as a function of 1.4 GHz flux density, from a se- 
lection of recent deep surveys. Shaded boxes, and 
the two lines for Padovani et al., show the range 
of uncertainty in the survey results. Arrows indi- 
cate constraints from other surveys. These results 
show that the fraction of star-forming galaxies in- 
creases rapidly below 1 mJy and, at the 50 /iJy 
survey limit of EMU, about 75% of sources will be 
star-forming galaxies. 



To estimate the redshift distribution of AGN 
and SF galaxies, we use the SKADS simulation 
( |Wilman et al.|2008[|20Tol ), shown in Fig. [5] About 
50 million of the EMU sources are expected to be 
star-forming galaxies (see ( 2.1 1 at redshifts up to z 
~ 3, with a mean redshift of z ~ 1.08. The remain- 
der are AGNs with a mean z ~ 1. 88, and extend up 
to z ~ 6. However, if any FRII (|Fanaroff fc Riley 



- 3.3 x 10 25 W.ffz -1 at z = 10), EMU will detect 
them. 

Confusion of radio sources, discussed more thor- 
oughly in §3.6. 1[ is well-understood at this level, 
since previous surveys have already imaged small 
areas of sky to this depth and beyond. 



i- 1* 




Figure 5 Expected redshift distribution of EMU 
sources, based on the SKADS simulations 
( |Wilman et al.|2008[[2010| . The five lines show the 
distributions for star-forming galaxies (SFG), star- 
burst galaxies(SB), radio-quiet quasars (RQQ), 
and radio-loud galaxies of Fanaroff-Riley types I 
and II (FRI & FR2; |Fanaroff & Riley |[l974| . 



Vertical scale shows the total number of sources 
expected to be detected by EMU. 



EMU differs from many previous surveys in 
that a goal of the project is to cross-identify the 
detected radio sources with major surveys at other 
wavelengths, and produce public-domain VO-accessible 
catalogues as "value-added" data products. This 
is facilitated by the growth in the number of large 
southern hemisphere telescopes and associated planned 
major surveys spanning all wavelengths, discussed 



below in £ 3.9 



1974| galaxies exist beyond that redshift (e.g. L 



1.4 Science 

Broadly, the key science goals for EMU are: 

• To trace the evolution of star-forming galax- 
ies from z — 2 to the present day, using a 
wavelength unbiased by dust or molecular 
emission, 

• To trace the evolution of massive black holes 
throughout the history of the Universe, and 
understand their relationship to star forma- 
tion, 

• To use the distribution of radio sources to 
explore the large-scale structure and cosmo- 
logical parameters of the Universe, and to 
test fundamental physics, 
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• To determine how radio sources populate dark 
matter halos, as a step towards understand- 
ing the underlying astrophysics of clusters 
and halos, 

• To create the most sensitive wide-field at- 
las of Galactic continuum emission yet made 
in the Southern Hemisphere, addressing ar- 
eas such as star formation, supernovae, and 
Galactic structure, 

• To explore an uncharted region of observa- 
tional parameter space, with a high likeli- 
hood of finding new classes of object. 

Table 1 gives an overview of EMU specifica- 
tions. In addition to the well-defined scientific 
goals outlined above, and the obvious legacy value, 
the large EMU dataset will include extremely rare 
objects, only possible by covering large areas. 

A challenge for EMU will be the lack of spectro- 
scopic redshifts, since no existing or planned red- 
shift survey can cover more than a tiny fraction of 
EMU's 70 million sources. As discussed in §3.11[ 
~ 30% of EMU sources will have multi-wavelength 
optical/IR photometric data at the time of data 
release, increasing to ~ 70% in 2020. We expect 
these to provide accurate photometric redshifts for 
the majority of star-forming galaxies in EMU, and 
a minority of AGN (for which photometric red- 
shifts tend to be unreliable). In addition, many of 
the EMU sources will have "statistical redshifts", 
which are valuable for some statistical tests. For 
example, most polarised sources are AGNs (mean 
z ~ 1.88), while most unpolarised sources are star- 
forming galaxies ( mean z ~ 1.08). More pre- 
cise statistical redshifts can be derived where op- 
tical/IR photometry is available, as discussed in 

Em 

A further goal of EMU is to test and develop 
strategies for the SKA. Many aspects of ASKAP, 
such as the automated observing, calibration, and 
data reduction processes, and the phased-array feeds, 
are potential technologies for the SKA, and it will 
be important to test whether these approaches de- 
liver the planned results. 

1.5 Relationship to other surveys 

The following radio surveys are particularly com- 
plementary to the scientific goals of EMU. 



The WO DAN survey ( Rottgering et al. 2010b \ 



has been proposed for the Westerbork tele- 
scope which is curren tly being upgraded wit h 
a phased array feed ( |Oosterloo et al. |2009 ) . 
WODAN will cover the northern 25% of the 
sky (i.e. North of declination +30°) that 



is inaccessible to ASKAP, to an rms sen- 
sitivity of 10 /iJy/beam and a spatial res- 
olution of 15 arcsec. Together, EMU and 
WODAN will provide full-sky 1.3 GHz imag- 
ing at ~ 10-15 arcsec resolution to an rms 
noise level of 10 /iJy/beam, providing an un- 
precedented sensitive all-sky radio survey as 
a legacy for astronomers at all wavelengths. 
The WODAN survey will overlap with EMU 
by a few degrees of declination to provide 
a comparison and cross-validation, to ensure 
consistent calibration, and to check on com- 
pleteness and potential sources of bias be- 
tween the surveys. 



The LOFAR continuum survey (Rottgering 



et al. 2010a) will cover the northern half of 
the sky (i.e. North of declination 0°) with 
the new LOFAR telescope operating at low 
frequencies (15-200 MHz). LOFAR will be 
especially complementary to WODAN and 
EMU in surveying the sky at high sensitiv- 
ity and resolution but at a much lower fre- 
quency. 



The MIGHTEE survey (van der Heyden 



& Jarvis 2010) on the Meerkat telescope 
( |Jonas||2009[ ) will probe to much fainter flux 
densities (0.1-1/zJy rms) over smaller areas 
(~ 35 square degrees) at higher angular reso- 
lution, providing the completeness as a func- 
tion of flux density for the EMU and WODAN 
Surveys. The higher sensitivity and resolu- 
tion will enable exploration of the AGN and 
star-forming galaxy populations to higher red- 
shifts and lower luminosities. 



POSSUM (Gaensler et al. 20101 is an all 



sky ASKAP survey of linear polarisation. It 
is expected that POSSUM will be commen- 
sal with EMU, and that the two surveys will 
overlap considerably in their analysis pipelines 
and source catalogues. POSSUM will pro- 
vide a catalogue of polarised fluxes and Fara- 
day rotation measures for approximately 3 
million compact extragalactic sources. These 
data will be used to determine the large-scale 
magnetic field geometry of the Milky Way, 
to study the turbulent properties of the in- 
terstellar medium, and to constrain the evo- 
lution of intergalactic magnetic fields as a 
function of cosmic time. POSSUM will also 
be a valuable counterpart to EMU, in that it 
will provide polarisation properties or upper 
limits to polarisation for all sources detected 
by EMU. 



FLASH ( |Ball et al.|2009| is an ASKAP sur- 
vey whose goal is to detect extragalactic neu- 
tral hydrogen absorption. To do so it will 
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observe at frequencies outside the 1130-1430 
MHz band of EMU, thus yielding valuable 
spectral index information for those sources 
common to both surveys. 



DINGO ( |Ball et al. 12009] ) is an ASKAP sur- 
vey whose goal is to detect faint extragalac- 
tic neutral hydrogen emission, and to do so it 
will spend many days on one ASKAP point- 
ing. As a by-product, it will thus provide 
sensitive continuum images over smaller ar- 
eas (several tens ofdeg 2 ), allowing EMU to 
explore fainter flux densities in an optimal 
tiered survey structure, and also to quantify 
the effects of confusion at this level. How- 
ever, the continuum images from DINGO will 
be severely confusion-limited at flux densi- 
ties below a few /zJy/beam. It may be pos- 
sible to transcend this limit by subtracting 
known sources from the image, such as those 
star-forming galaxies which are seen in in- 
frared images and whose radio flux can be 
predicted using the IR-radio correlation. How- 
ever, this challenge is currently external to 
the core EMU project. 



VAST ( Chatterjee et al. 2010 1 is an ASKAP 



survey that will observe partly commensally 
with EMU, with the goal of detecting tran- 
sients and variable sources. EMU has no 
planned transient capability, since all infor- 
mation on variability of EMU sources will 
be available from VAST. This separation en- 
ables each of EMU and VAST to focus on 
its specific science goals, although significant 
coordination between the projects will clearly 
be essential. 



• WALLABY ( |Koribalski et^[20TT| ) is an 
HI survey which will deliver high-sensitivity 
spectral line data over the same area of sky 
as EMU, and will observe commensally with 
EMU. Observations will give a velocity cov- 
erage of -2,000 to +77,000 km s^ 1 (z = - 
0.26) and velocity resolution of 4 km s -1 . 
The angular resolution for WALLABY will 
be 30 arcsec, a factor three lower than EMU, 
as computing resources to make the large 
spectral line data cubes are restricted to base- 
lines shorter than ~2 km. Nearly all the 
~ 5 x 10 5 sources detected by WALLABY 
will also be detected by EMU, and WAL- 
LABY will provide an HI redshift for each 
of these, adding significantly to the redshift 
information for low-redshift EMU sources. 

This paper defines the EMU survey, setting out 
its science goals in § 2, and identifying the chal- 
lenges to achieve these goals. § 3 describes how 



these challenges are being addressed in the EMU 
Design Study, and § 4 describes the survey opera- 
tional plan, primary data products, and the data 
release plans and policy. 

2 EMU Science Goals 

2.1 Star- forming galaxies and AGNs 

The fraction of star-forming galaxies as a function 
of flux density is shown in Figure [4] Of the ~ 70 
million sources detected by EMU to a 5 a limit of 
50 ^Jy, about 20 million galaxies are expected to 
be dominated by Active Galactic Nuclei (AGN), 
and 50 million to be dominated by star formation 
(SF). However, there is considerable overlap be- 
tween the two classes, with composite AGN/SF 
galaxies becoming more common at low flux den- 
sities (e.g. Chapman et al. 2003 Norris et al.|2006 



Seymour et al.||2008| ) 

It is unclear what fraction of putative SF galax- 
ies have a significant AGN component. However, 
the excellent agreement in star-formation rate be- 
tween radio and other star-forming indicators (e.g. 
Cram et all 19981 IBell 120031) suggests that an AGN 



is not a major contributor to the radio emission in 
such galaxies. 

Detected AGNs and star-forming galaxies span 
a significant fraction of the age of the Universe, 
almost reaching the era of re-ionisation for radio 
AGNs and the most extreme starbursts. Partic- 
ularly at high redshift, both AGN and star for- 
mation processes are likely to be important in a 
large fraction of galaxies, but neither the fraction 
of the luminosity (bolometric and radio) generated 
by each process, nor how they are influenced by 
feedback, is currently known. EMU will explore 
the evolution of these populations and its depen- 
dence on galaxy mass, environment, SF history, 
and interaction/merger history. It will quantify 
these effects in detail, by providing a deep homo- 
geneously selected sample of both AGN and SF 
galaxies over the majority of cosmic history, unbi- 
ased by dust obscuration, and so provide a com- 
prehensive overview of galaxy evolution. 

The EMU analysis pipeline, which will encom- 
pass automated multi- wavelength cross-identification 
of sources between the EMU catalogue and other 
complementary surveys, will also include a vari- 
ety of measures appropriate for distinguishing be- 
tween, and quantifying the proportions of, AGN 
and star-forming activity. These will include: 



Radio morphology (e.g. Biggs fc Ivison 2008 



Biggs, Younger fc Ivison ||2010 |, 



• Radio spectral index (e.g. Ibar et al. 2009 
20101, 
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• Radio-far-infrared ratio, 

• Radio-near-infrared ratio, 



• Radio polarisation, fro m the POSSUM project 
( [Gaensler et al.||2010| ), 



• Radio variability, from the VAST project ( CI .at- 
terjee et al~||2010[ ), 



• optical and IR colours, to be used in SED 
analysis and comparison with templates. 

These diagnostics and the algorithm which encodes 
them will be trained prior to the EMU data using 
the ATLAS data ( |Mao et al.||2011b |. 



2.2 



Evolution of star-forming 
galaxies 



Tracing when and where stars formed across cos- 
mic time is one of the key questions in galaxy evo- 
lution today. There is now evidence that star for- 
mation depends both on galaxy stellar mass (iFeul- 



ner et al.|2005 Juneau et al. 2005; Papovich et al. 


200G 


Mobasher et al.||2009 see Figure 7 


1 and en- 



and these depen dencies also evolve with time ( El- 
baz et aL||2007 ). Massive galaxies appear to form 
their stars early and quickly, progressively becom- 
ing less active after z ~ 2, while lower mass galax- 
ies become dominant at lower redshifts. As a re- 
sult, the cosmic SFR density is dominated by pro- 
gressively less massive galaxies in less dense envi- 
ronments at lower redshifts. This process is known 
as cosmic "downsizing" (Cowie et al. 1996[), al- 



thoug h it has been challenged by r esults from |Zhen^ 



et al 



(2007] and punne eTal] p009 | 



EMU, with 50 million SF galaxies and a large 
collection of complementary data, will accurately 
determine the true dependence of star formation 
on mass, environment, and redshift, in all types of 
galaxy from normal star-forming galaxies up to the 
most UltraLuminous InfraRed Galaxies (ULIRGs) . 
Radio observations have a considerable advantage 
over most other diagnostics of star formation as 
radio luminosities are directly related to star for- 
mation rates ( Condon |1992 Haarsma et al.|2000 



Bell 2003) through a calibration reliable at least 
up to z ~ 2 (Garn et al. 2009[ ), and no correc- 



tions for absorption by gas or dust are required. 
However, to measure star formation rates from ra- 
dio luminosities requires that any AGNs are re- 
moved from radio samples, and this can largely 
be achieved using a variety of indicators, as dis- 
cussed in §2.1| For tracing evolution, EMU sources 
will necessarily be limited to those for which red- 
shifts can be measured or estimated, as discussed 
in [3.11 Fortunately, photometric redshifts prove 



to be very accurate fo r SF galaxies (see e.g. Rowar 
Robinson et al.|[2008 1 and a large fraction of star- 
forming galaxies detected by EMU will eventually 
have reliable photometric redshifts. 




Figure 6 Star formation rate (in M /yr) of in- 
dividual star-forming galaxies detectable by EMU 
at 5 ct as a function of redshift. The calculation of 



SFR follows Bell (2003) modified following Sey 



mour et al. ( |2008 1, and the k-correction assumes 



a spectral index of -0.7 



EMU's surface brightness sensitivity of 0.06 K 
is well below the median face-on surface bright- 
ness of a typical spiral galaxy of ~ 1 K at 1.4 GHz. 
So, unlike previous large radio surveys, EMU will 
have the brightness sensitivity needed to detect 
nearly all resolved normal spiral galaxies. Lumi- 
nous starburst galaxies, with star formation rates 
(SFRs) around 100 M^r" 1 , will be detectable to 
z ~ 2 (see Fig. [6]) , while ordinary disk galaxies like 
the Milky Way, with SFRs of only a few M Q yr~\ 
will be visible to z ~ 0.3, and many of these will 
have spectroscopic redshifts measured by WAL- 
LABY. Star-formation rates of classes of galaxy 
at higher redshifts can be estimated using stacking 
(see [3.10), provided that AGNs have been elimi- 



nated from the stacked sample. 

EMU will therefore trace the co-moving SFR 
density (e.g.,|Lilly et al.|1996| |Madau et al.||1996 



Hopkins ||2004| |Hopkins fc Beacom ||2006[ ) up to 

high redshift with greater accuracy than previ- 
ously possible, and thereby determine when most 
of the stars in the Universe formed. The wide area 
of this survey will overcome the problems of low 
number statistics and sample variance sometimes 
associated with deep but small-area radio surveys 
( |Hopkins et~aT|2003[ ) . 



2.3 Evolution of AGNs 

AGNs play a major role in the framework of galaxy 
formation. During their short active lifetime, they 
release an enormous amount of energy in the form 
of ionising radiation or relativistic jets, which can 
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Figure 7 Evolution with redshift of the star for- 
mation rate density (SFRD) in galaxies, p*. The 
grey and shaded regions in both panels are the 1-tr 
and 3-<7 confidence region fits to the com p ilation 
of SFRD data from ~~ 



Hopkins & Beacom (2006). 



T op: M easu red SFRD taken from|Mauch fc Sadler 



| ( [2007] and |Seymour et al.| ( |2008[ ). Bottom: The 



SFRD shown as a function of ga l axy ma ss, adapted 
from Fig. 7 of iMobasher et al.l (120091). 



have a significant effect on the host galaxy and its 
surroundings. 

The peak of QSO activity took place at z~2 
(e.g. |Schmidt ||1968[ |Shaver et al.|[l996l |Croomet 



al.|2004| |Hasinger et al.||2005[ ), "at epochs when SF 



activity was also extreme. Intriguingly, embedded 
AGNs have been found in 20-30% of z~2 massive 
star-forming galaxies (Daddi et al. 2007). It has 



even been suggested that the AGN jets could be 
responsib le for the formation of galaxies at high 
redshift ( |Elbaz et al.||2009| |Klamer et al.||2004[ ). 

At lower redshifts, AGNs appear to downsize, 
in a similar way to that of SF galaxies, so that the 
peak of the AGN activity appears to shift signif- 
icantly to lower redshifts for lower power AGNs. 
This suggests (|Cowie et al.||1996|) that a feed- 



back mechanism couples AGNs to galaxy evolu- 



tion (Hasinger et al. 


2005 


Bongiorno et al. 


2007 


Padovani et al. 2009 


, via 


supernovae, star 


Hirsts 



or AGNs ( |Croton et al]|2006| |Bower et al!|2006| ) 



AGN jets either push back and heat the infalling 
gas, reducing the cooling flows building up the 
galaxy, or shock- heat and collapse the gas clouds, 
inducing star formation. The AGN energetic feed- 
back appears to be an important ingredient for re- 
producing the galaxy stellar ma ss function (Cro- 
ton et ah] 2006 Best et aL||2006 ), and the remark- 



able black hole vs. bulge mass (or velocity disper- 
sion) correlation ( Gebhardt et al.||2000 Ferrarese 
fc Merritt pOOO} |Springel et al.||2005[ K 

Recent work work on low-z 3CR galaxies by 



Ogle et al. ( 2010 ) suggest that radio jets can lead 
to significant shock-heating of the host molecular 
disk, leading to a significant enhancement of warm 
molecular hydrogen emission. Despite their large 
molecular masses, these systems seem to be very 
inefficient at making stars, suggesting that shock- 
heating by radio jets can play a negative-feedback 
role in star formation in these systems. 

Low power radio-loud AGNs (P< lO^WHz- 1 ) 
appear quite different from the high-power radio- 
loud AGNs. Many do not show the luminous nar- 
row lines expected in the framework of AGN uni- 
fication (see e.g. 



Hine fc Longair 



et al.||1994| | Jackson fc Rawhngs||1997[ ). They also 



1979 Laing 



lack the expect ed infrared emission from a dusty 
Whysong fc Antonucci|2004| |Ogle 



torus (see e.g. 



et al. 2006) a nd do not show accretion related X- 
ray e mission ( Hardcastle et al.]|2006 Evans et al. 



2006). The sum of this observational evidence sug 
gests that low power radio sources correspond to 
a distinct phase of AGN, accreting through a ra- 
diatively inefficient "radio mode" , rather than the 
radiatively efficient accretion "quasar m ode" typi- 
cal of optical or X-ray selected AGNs (Croton et 



al 



be 



2006| Bower et al. 2006J) . The physical reasons 



rind these two different accretion modes are still 



uncle ar (e.g. Hardcastle et al.|2007 Herbert et al 



2010) 



The key questions related to AGN evolution 
that EMU will address include: (1) The relation- 
ship between AGN and SF activity; (2) The evolu- 
tion of low and intermediate power AGNs, explor- 
ing the so-called "AGN cosmic downsizing" sce- 
nario, (3) The relative contribution of different ac- 
cretion regimes (radio vs. quasar modes) in low 
and intermediate luminosity radio AGNs, its evo- 
lution with redshift, and the role played by the 
environment; (4) The relative contribution of ra- 
diative versus jet-driven (kinetic) feedback to the 
global AGN feedback in models of galaxy forma- 
tion; and (5) the evolution of radio galaxies as a 
function of environment, spectral index, redshift, 
radio luminosity, and source size. 
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2.4 High-redshift AGNs and the 
Epoch of Re-ionisation 

There have been many at tempts to find very high 
redshift radio galaxies (De Breuck et al. 2001 



Jarvis et al.||2001a|b| [Best et al.||2003[ |Cruz et~al 



2006 



2007) but all suffer from the difficulty of find- 



ing these extremely rare galaxies. EMU will iden- 
tify numerous radio galaxies at z>4 and isolate 
the most distant objects by cross-matching with 
large near- and mid-infrared surveys, as described 
in §3.9| EMU will also provide spectral indices and 
(with POSSUM) polarisation information which 
can also help identify high-redshift AGNs. 

High-redshift AGNs from EMU will constrain 
the existence and demographics of the most mas- 
sive galaxies over the history of the Universe (e.g. 
Jarvis et aLl|2001b|c| |Wall et al.||2005[ ). These 
sources provide constraints on the co-evolution of 
galaxy bulges and central supermassive blac k holes 
( IMagorrian et al.||1998| |McLure et al.||2006D, trace 



(prot o-)clusters at early times (e.g. |Miley et al 



2006 ) , and may be crucial in determining the im- 



pact that p owerful radio activity has on the host 
galaxy (e.g. Croton et al.|2006 Bower et al.|2 006 ) 



and its larger scale environment ( Rawlings & Jarvi s 
l|2004l |Gopal-Krishna, fc Wiita ||2001[ |Elbaz et al 
20091. 



Infrared-Faint Radio Sources (IFRS) are prob- 
ably a particular class of radio source that are 
characterised by a very high radio-infrared ratio 
(5 , 20cm/S , 3.6 A1 m > 500) and a low infrared fl ux den - 
sity. First identified by |Norris et al .| (|2006|), 



there is strong circumstantial evidence that they 
are high-redshift radio galaxies, based on their SEP 



( |Garn & Alexander ||2008| |Huynh et aL||2 010) 
their steep spectral index (|Middelberg et al. 
201 1[), their VLBI core s ( |Norris et al. ||2007 



Middclb erg et al |2008b I , and their extrem e faint 



ness in the infrared {S^.^^m < 1.2^iJy; 

Observations suggest ~ 5 
occur per deg 2 at a flux limit of ~ lOO^Jy 



Norris 



FRS' 



(Nor- 



ris et al. 2006 



Middelberg et al.||2008a ), implying 



that EMU will detect at least 1.5 x 10 5 , but the 
lack of corresponding deep infrared data will pre- 
vent their identification from other unremarkable 
non-detections. However, many will be located 
in smaller deep infrared fields such as those ob- 
served with HERMES ( [Oliver et aLpH0| ), which 
should yield several thousand IFRS's, enabling us 
to compile solid statistical data on their distribu- 
tion, spectral index, and polarisation. Thousands 
more will be selected as candidate high-redshift 
AGNs thro ugh t heir steep spectrum and polari- 
sation (see ^3.11), and it expected that they will 



turn out to be an important class of high-redshift 
AGN. 

A major objective of current extragalactic as- 



tronomy is to understand the Epoch of Reionisa- 
tion (EoR) , when ultraviolet photons from the first 
stars and quasars ionised the primordial neutral 
hydrogen. This process can in principle be stud- 
ied by measuring the neutral hydrogen fraction in 
the early Universe using the 21 cm forest in front of 
a bright distant source ( Carilli et al.|2002 ), analo- 
gous to the Ly-a forest seen agai nst bright quasars 
at lower redshifts (e.g. Peroux et al. 2005). Such 



observations require a population of z>6 radio- 
loud background sources, but the highest redshift 
radio galaxy currently known lies at only z=5.2 
van Breugel et al. 1999). EMU provides an ex- 



cellent opportunity to identify such sources, pro- 
ducing a large sample of distant radio sources for 
investigating the formation and evolution of the 
most massive galaxies at the highest redshifts. 

2.5 CSS and GPS sources 

Compact Steep Spectrum (CSS) sources are typ- 
ically defined to be radio sources with a spectral 
index < —0.5, and with a size less than about 20 
kpc, and hence of sub-galactic dimensions. Gi- 
gahertz Peaked Spectrum (GPS) sources are ra- 
dio sources whose spectrum reaches a maximum 
in the frequency range 1-10 GHz. Typically they 
are significantly smaller (hundreds of pc) than CSS 
sources, and confined to the circumnuclear regions 
of the host galaxy. 

Approximately 10% and 30% of bright centime- 
tre wavelength sources are GPS and CSS sources 
respectively. If similar statistics hold for AGNs at 
low flux densities, EMU will discover ~ 2 million 
GPS sources and ~ 5 million CSS sources. Most 
studies of CSS and GPS objects have been con- 
fined to sources selected from strong source sur- 
veys, and current sa mples only probe down to ~ 
10m Jy at 1.4 GHz (fTschager et al.||2003| [Snellen" 



et al. 1998 1999 Randall et al. 20111. EMU will 



offer a complete, faint sample of these intriguing 
objects, providing a probe into the evolution of 
young radio AGNs, and showing how these objects 
fit into models of galaxy evolution. 

There is a consensus (O'Dea 1998 Fanti 



2009a|b[ [Morganti et al.||2009| |Snellen et al.||2009[ ) 

that GPS and CSS sources represent the start of 
the evolutionary path for large-scale radio sources. 
It is generally accepted that most GPS sources 
evolve into CSS sources, which gradually trans- 
form into the largest radio sources known, Fanaroff- 
Riley Type I and II galaxies ( |Fanaroff k. Riley~| 
1974 ) , depending on their initial luminosity. These 



sources offer an ideal resource to investigate early 
galaxy evolution and formation, as well as AGN 
feedback, as they are young AGNs but also have 
star formation occurring due to interactions and 
mergers (Labiano et al. 2008 Morganti 2008). 
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Measurements of component advance speeds for 
a few compact sources yield ages of about 10 3 yr, 



10 5 yr ( 


0'Dea|l998 


Owsianik et al.|l999 


Murgia 


et al. 


19990. 



infall of gas to the supermassive black holes, trig- 
gered by interactions with companions and merg- 
ers. CSS and GPS sources show evidence, from 
both their structural and polarisation asymmetries, 
that the jets are interacting with an asymmetric 



external environment (Saikia et al. 2003 Cotton 



et al.|[2003] ) . CSS and GPS sources show evidence 



of Hi absorption more often than other radio-loud 
AGNs, with the GPS objects showing the highest 
incidence of absorption (Gupta et al. 2006 and 



references therein). The EMU sample of CSS and 
GPS sources over a large redshift range will en- 
able the variation of the Hi properties with cosmic 
epoch to be determined. 

Several CSS and GPS objects show evidence of 
diffuse extended emission which may be a remnant 
of an earlier cycle of activity. An important goal of 
AGN physics is to understand the episodic nature 
of nuclear or jet activity ( |Saikia et al.|2009| ). EMU 
with its high surface brightness sensitivity will be 
an ideal survey to probe the existence of such dif- 
fuse emission for a large number of sources, and 
constrain the time scales of episodic activity for 
these compact objects. 



mil 



Figure 8 This multi-wavelength composite image 
of the inner part of the spiral galaxy M 83 high- 
lights the synergy between the two key ASKAP 
large survey science projects: WALLABY and 
EMU. The ATCA 20-cm radio continuum emis- 
sion is shown in red, the ATCA H I distribution 
in green, and the GALEX FUV emission in b lue 
( |Koribalski et al.|[2008j [Gil de Paz et"aT~|[2007l ) . 



2.6 Low redshift galaxies, and syn- 
ergies with WALLABY 

WALLABY is expected to detect HI emission from 
~ 5 x 10 5 galaxies to a depth of z = 0.05, with mas- 
sive galaxies detected out to z = 0.25. The ma- 
jority of these galaxies will be spiral, with typical 
Hi masses of a few times 10 9 M©. Their large gas 
reservoir fuels star formation, implying that all spi- 
ral galaxies detected by WALLABY will have 20- 
cm radio continuum emission detectable by EMU. 
As a result, WALLABY is expected to contribute 
- 5 x 10 5 redshifts to EMU. 

In return, EMU will be able to measure star 
formation rates of the galaxies studied by WAL- 
LABY, enabling detailed studies of the factors that 
influence star formation rates in the local Uni- 
verse. A goal of WALLABY is to measure local 
and global star formation (SF) rates for all gas-rich 
spirals and compare their SF and H I distributions 
(see Fig. [8]) . 

Although most WALLABY data will have a 30- 
arcsec resolution, high-resolution (10 arcsec) Hi 
'postage stamps' will be obtained of particularly 
interesting nearby galaxies, allowing a more de- 
tailed analysis and comparison with data at other 
wavelengths. 

2.7 Galaxy clusters 

Several different types of diffuse radio emission are 
associated with clusters of galaxies (Kempner et 



al. 2004), including haloes around the centres of 
clusters, relics (representing shocks from cluster- 
cluster collisions) at the periphery, and tailed radio 
galaxies which are an important barometer of the 
intra-cluster medium. These three classes of radio 
source are important as tracers of clusters, and are 
also diagnostics of the physics of clusters, particu- 
larly when combined with X-Ray data. However, 
the number of detected cluster radio sources is lim- 
ited by current telescope sensitivities (see Fig[9|. 
EMU will not only give us large samples of such 
sources, but will push beyond the present limits 
to detect diffuse sources with a range of powers 
over a larger redshift range, greatly improving our 
understanding of these sources. 

Most clusters have been found cither through 
X-ray surveys (Rosati et al. 1998; Romer et al. 
2001; Pierre et al. 2003) or by searching for over- 
densities in optical colour-position space (Gladders 
& Yee 2005; Wilson et al. 2008; Kodama et al. 
2007) . As a result, tens of thousands of clusters are 
currently known, but only a few at z> 1 (Wilson 
et al. 2008, Kodama et al. 2007), with the highest 
redshift at z = 2.07 (Gobat et al. 2011). 
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2.7.1 Haloes 

Radio haloes are found in clusters and groups of 
galaxies, indicating synchrotron emission powered 
by diffuse and faint (0.1 - 1 fiG) magnetic fields 
and relativistic particles. So far about 35 radio 
halos are known with z < 0.4 and only 2 at z ~ 
0.5, generally discovered by making deep radio sur- 



veys of hot and bright X-ray cl usters (Venturi et 
al.|2008 Giovannini et al.(20 09 ) . A strong correla- 



tion is present between the total halo radio power 



and the cluster X-ray luminosity (Giovannini et 



al. 2009 1 . Since the cluster mass and X-ray lu- 



minosity are correlated it follows that halo radio 



power correlates with the cluster mass ( Feretti 



2000 Govoni et al. 2001 


. Brunetti et al.| 


Cassano et al. 


piOp; 


Schuecker et al.j 


have suggestec 


that rac 


io haloes in the 



P00M 



(2001) 



of clusters are distributed bimodally, with haloes 
generally found only in those clusters which have 
recently undergone a merger, resulting in a dis- 
turbed appearance at X-ray wavelengths. 

The ATLBS survey ( jSubrahmanyan et al.|2010| ). 
which has surveyed 8.4deg z to an rms sensitivity 
of 80 /uJy/beam on a scale size of 50 arcsec at 
1.4 GHz, has detected tens of diffuse sources, of 
which about 20 have been tentatively identified 



as cl uster and group haloes (Saripalli et al. 
2011). EMU will have even better sensitivity to 
low-surface-brightness structures than ATLBS, so 
if the ATLBS numbers are confirmed, then EMU 
will discover ~ 6 x 10 4 cluster and group haloes, 
which significantly increases the number of known 
clusters. An important science goal will then be to 
compare the X-ray properties (luminosity, temper- 
ature and, for the brighter clusters, morphologies) 
of these radio-selected clusters to those of the X- 
ray selected population from the eR OSITA all-sky 
X-ray survey ( |Predehl et aL|2010[ ). 



2.7.2 Relics 

On the periphery of clusters, elongated radio "relics" 
are found, which probably represent the signatures 
of shock structures generated in cluster mergers 



(Rottgering et al. 1997 Brown et al. 2011a 



van 



Weeren et al. 2010). They provide important di- 



agnostics for the dynamics of accretion and merg 
ers by which clusters form ( Barrena et al. 2009 ) 



Large populations of these structures will appear 
in EMU, and are likely to lead to new cluster iden- 
tifications especially beyond z ~ 0.5. For example, 



only 44 radio relics are currently known ( Hoeft et 



al. 2011), and few have been discovered in current 
radio surveys because of the relatively poor sen- 
sitivity of most surveys to low-surface-brightness 
structures. One probable relic has been discov- 
ered in the seven square degrees of ATLAS ( Mid- 



delberg et al.|2008a||Mao et al.|2010[ ). Since EMU 
will have greater sensitivity to such low-surface- 
brightness structures than ATLAS, this suggests 
that EMU should detect > 4000 relics, although 
this number is clearly very uncertain. As a means 
of finding clusters, it is less effective than other 
radio and X-ray techniques, but will be invaluable 
for studying shock structures accompanying clus- 
ter mergers. Furthermore, relics show evidence of 
ordered large scale magnetic fields in the periph- 
ery of galaxy clusters, in regions with a very low 
density of galaxies and thermal gas. 

2.7.3 Tailed radio galaxies 

Head-tail, wide-angle tail (WAT), and narrow-angle- 
tail galaxies (collectively named "tailed radio galax- 
ies" ) are believed to represent radio- loud AGNs in 
which the jets are distorted by the intra-cluster 
medium ( |Mao et al.||2009| |2010[ ). T hey can also 
contribute to the diffuse emission (Rudnick & 



Lemmerman ||2009[ ) , especially after the jets from 
the nucleus of the host galaxy have turned off; 
EMU's high resolution and sensitivity, especially 
when combined with polarisation information from 



the c ommensal POSSUM survey ( jGaensler et al. 



2010), will allow us to distinguish these from the 
(largely unpolarised) cluster-wide halo emission. 
Even more importantly, such tailed galaxies can 



be detected out to high redshifts ( |Wing fe Blanton 
2011 Mao et al.|2010 ), providing a powerful diag 



nostic for finding clusters. From the W AT's discov- 
ered in the ATLAS fields, |Mao et all (|2011c|) and 



Deghan et al. (2011) have estimated that EMU 
will detect at least 26000, and possibly as many 
as 2 x 10 5 WATs, depending on their luminosity 
function and density evolution. 

2.7.4 AGN feed back in galaxy clusters and 
mini-halo sources 

The relatively cool and dense gas at the centres 
of many galaxy clusters and groups emits copious 
X-ray radiation by thermal bremsstrahlung and 
line emission. In the absence of external sources 
of heating, this high emission should lead to very 
rapid cooling (t coo / < 1 Gyr) and very high rates 
of mass deposition onto the central cluster galaxy 
(up to ^1000 M Q /yr), in turn causing very high 
star formation rates and strong X-ray line emis 
sion (e.g. |Fabian fc Nulsen 



ney||1977| |Peterson fc Fabian 



1977 Cowie & Bin- 



[20061 |McNamara & 



Nulsen||2007| . The lac k of such obvious observa- 
tional signatures (e.g. Peterson et al.|200T 2003 1 
implies that some central source of heating must 
be present. The most plausible source of heating 
is feedback from the central AGN. EMU will en- 
able a statistically significant study of the correla- 
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Figure 9 1.4 GHz radio power of detected clus- 
ter halos as a function of redshift showing the 
detection limi ts of previous clu st er obs ervations, 
adapted from Giovannini et al. (2009), and the 



calculated detection limit of EMU, assuming a 
halo with a diameter of 1 Mpc. The upper line 
shows the limit corresponding to a scale size of 
15 arcmin, which is approximately the largest size 
object that can be imaged with the VLA unless 
single-dish data are added to the interferometry 
data. 



tion between the radio emission from AGNs at the 
cluster centre with the thermal and non thermal 
cluster properties, and explore how it evolves with 
redshift. 

Moreover a few cooling-core clusters exhibit 
signs of diffuse synchrotron emission that extends 
far from the dominant radio galaxy at the cluster 
center, forming what is referred to as a mini-halo. 
These diffuse radio sources are extended on a mod- 
erate scale (typically ~ 500 kpc) and, in common 
with large-scale halos, have a steep spectrum and 
a very low surface brightness. Because of a combi- 
nation of small angular size and the strong radio 
emission of the central radio galaxy, the detection 
of a mini-halo requires data of a much higher dy- 
namic range and resolution than those in avail- 
able surveys. The study of radio emission from 
the center of cooling-core clusters is of large im- 
portance not only in understanding the feedback 
mechanism involved in the energy transfer between 



mara & Nulsen 2007) but also in the formation 
process of the non-thermal mini-halos. The en- 
ergy released by the central AGN may also play a 
role in the formation of these extended structures 
(e.g. |Fujita et al.||2007| . 



2.7.5 The impact of EMU on cluster re- 
search 

The key impacts of EMU are likely to be: 

• To increase the number of known clusters 
beyond the few tens of thousands currently 
known. EMU will detect at least 3 x 10 4 



new clusters (Mao et al. 


2011c 




Norris et 


al. 


2011c 


Deghan et al. | 


20111, which will 



roughly double the number of known clus- 
ters. Depending on the redshift distribu- 
tion and luminosity function of cluster radio 
sources, EMU may d etect as many as ~ 2 x 
10 5 cluster sour ces ( Mao et aL]|2011c Nor- 



ris et al.||2011c ). eRosita is also expected to 



detect ~ 10° clusters at X-ray wavelengths 



(Predehl et al. 2010 Pillepich et al. 2011) 



and comparison of these two complementary 
surveys will be transformational. 

• To detect clusters at high redshifts. In prin- 
ciple, radio sources can be used to detect 
clusters even beyond z=l, where current con- 
straints on large-scale structure are weaker, 
and traditional detection techniques like X- 
ray surveys and use of the Red Cluster Se- 
quence (Gladders & Yee 2005) become less 
effective. A few clusters have already been 
detected up to high redshift using radio de- 
tections (Blanton et al. 2003, Wing et al. 
2011), but extrapolation beyond z=l is un- 
certain because the luminosity function and 
evolution of cluster radio sources are unknown. 
Furthermore, inverse Compton cooling of elec- 
trons by the cosmic microwave background is 
expected to quench their synchrotron radio 
emission at z 3> 1, although confidence in 
this expectation is reduced by the failure to 
detect a similar effect in the far-IR-radio cor- 
relation at high redshift (Mao et al. 2011a). 

• To explore radio properties of clusters de- 
tected at other wavelengths. It will be im- 
portant to compare the properties of radio- 
selected clusters from the unbiased EMU sur- 
vey to those of the X-ray selected population 
from surveys such as the eROSITA all-sky 
X-ray survey (Predehl et al. 2010), and the 
Sunyaev-Zeldovich surveys made with the South 
Pole Telescope (Williamson et al. 2011), At- 
acama Cosmology Telescope (Marriage et al. 
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2010) and Planck (Planck Collaboration, 2011) 
surveys. EMU will enable not only statisti- 
cal approaches (e. g. halo occupation distri- 
bution modelling: Peacock fc Smith~| 2000) 
but also directly observed ovcrdcnsities using 
tracers such as WATs ( |Mao et alT]|2010| ). 

• To detect new forms of cluster radio emis- 
sion. For example, high-resolution numeri- 



cal simulations (Battaglia et al. 2009) pre- 



dict an additional network of weak shocks 
throughout the cluster volume which can only 
be seen with sufficient resolution and sensi- 
tivity, but which will be detectable with the 
short spacings of ASKAP. 

Additional cluster-related goals to be addressed 
by EMU include: 

• Do the radio properties of AGNs depend more 
on the properties of their host galaxy or of 
their local environment, such as gas temper 
ature or cooling time (e.g 
20091? 



Mittal et al. 



• Are galaxies of a given mass more likely to 
host a radio source if they are a central or 
satellite galaxy within a halo? 

• Is star formation truncated in halos above a 
given mass by AGN feedback or virial shock 
heating? 

• How does the observed correlation and in- 
dicated feedback cycle between cluster cen- 
tral radio AGNs and the cooling intr acluster 
medium evolve with redshift (e. 
etliri[20T0] )? 



Santos 



emission from cosmological filaments, increasing 
our knowledge of the physical properties of large- 
scale structures. The shortest spacing of ASKAP, 
20m (resulting in sampling of the uv plane down 
to 8m in a joint deconvolution) , ensures that EMU 
will be sensitive to structures as large as 1°, whilst 
even larger scale sizes can be imaged by adding in 
single-dish data. 

In a few cases, the synchrotron emission from 
filaments may be detected directly. One putative 
example is the bridge of radio emission extend- 
ing over the 1 Mpc between the central halo of 
the Coma cluster and the peripheral relic sourc e 
1253+275 ( |Kim et al.|1989l|Kronberg et al.|2007j ). 
Another is the radio emission from ZwCl 2341.1+0000 
which is a linear structure some 6 Mpc in length 
( |Bagchi et aL|2002| [Giovannini et aL]|2010 ). 

Even though EMU will be able to detect only 
the brightest shock structures in the WHIM di- 
rectly, statistical characterisation of an ensemble 



of fainter shock emission is possible (Keshet et al. 
2004 Brown et al.||2010 |. Faint synchrotron emis- 



sion due to WHIM shocks is highly correlated with 
large-scale structure (LSS) on ~+-4 Mpc scales 



flPfrommer et al. 2007 


Ryu et al. 2008 


Skillman et 


al. 2008 


2010 


), and can thus be detected below the 



EMU noise limits through LSS cross-correlation 



((Brown et al. 


2010) 


described in £ 


3.11| 



photometric or spectroscopic redshifts. The sky 
coverage of EMU coupled with its high sensitivity 
to low surface-brightness emission makes it ideal 
for detecting the synchrotron cosmic-web at cos- 
mologically important redshifts 0.1 < z < 1.0 
(Brown et al. 2011b ), corresponding to ^0.1-0.5 Mpc 



on arcmin scales. 



2.8 Cosmic filaments, and the Warm- 
Hot Intergalactic Medium 
(WHIM) 

Approximately half of the Universe's baryons are 
currently "missing" (Cen & Ostriker 19991 and 



are likely to be contained in the elusive Warm- 
Hot Intergalactic Medium (WHIM), where tem- 
peratures of 10 5 — 10 7 K make them extremely 
difficult to detect ( Fraser-McKelvie et al. 2011). 



By subtracting the contribution of compact ra- 
dio sources, studies of low-surface-brightness emis- 
sion with EMU can illuminate the otherwise in- 
visible baryons associated with large scale struc- 
ture. Perhaps the most important diffuse radio 
sources are those illuminating shock structures in 
the WHIM, which occur during infal l onto and 
al ong large-scale-structure filaments (Miniati et 
al~1p00T| [Dolag et al.||2008 ). 



Because of the sensitivity and short spacings 
of ASKAP, EMU will be able to detect faint radio 



2.9 Cosmology and Fundamental Physics 
2.9.1 Background 

The existence of dark energy is one of the most 
profound problems in cosmology. The evidence 
for its presence is indirect: it is implied by the 
supernovae Type la Hubble relation (Riess et al. 
1998 Perlmutter et al.|1999 1 , by the combined in- 



ference of flat geometry (Tt to t = 1) from Cosmic 
Microwave Background (CMB) measurements and 
low mass density (f2 mass 
structure measurements 



~ 0.3) from larg 


;e-scale 


(Cole et al. 


2005 


Eisen- 



stein et al.|2005[), and from galaxy cluster measure- 



ments (Vikhlinin et al. 2009 Mantz et al. 2010 



Allen et al.||2008[ ). 

A further puzzle is the nature of gravity. While 
General Relativity (GR) is consistent with all cur- 
rent observational measurements (e.g. Uzan 2003 ) 



it fails to connect gravity with the other funda- 
mental forces, models for which lead to hypothe- 
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Figure 10 Predicted cross-correlation of the CMB 
(measured by WMAP etc.) with EMU sources, 
assuming a standard ACDM cosmology. The solid 
lines is the theoretical prediction, and the shaded 
region shows cosmic variance errors. 



ses su ch as "brane- worlds" ( Brax & van de Bruck 
2003 ) in which gravity is modified by leakage into 
dimensions other than the brane dimensions. Such 
models predict deviations from GR which have not 
yet been fully tested. 

EMU will be able to perform independent tests 
of models of dark energy and deviations from GR 
using a combination of (a) the auto-correlation of 
radio source positions, (b) the cross-correlation of 
radio sources with the CMB (the late-time Inte- 
grated Sachs Wolfe (ISW) effect), and (c) cross- 
correlation of radio sources with foreground ob- 
jects (cosmic magnification). 

We assume throughout most of this section that 
redshifts of EMU sources are not available, and at 
the end consider the impact if statistical redshifts 
become available. Throughout this section, we as- 
sume a 5 a detection limit, which is expected to 
be rel i able fo r the EMU survey, while |RaccaneTIi] 



et al. (20111 use a conservative 10 o limit, which 



gives rise to slightly different constraints. 
2.9.2 Autocorrelations 

The auto-correlation function (ACF) of source counts, 
also known as the two-point correlation of radio 
sources, measures the degree of clustering of the 
radio sources (se e e.g. Blake & Wall 



canelli et al.||2011 ). Here we consider t 



2002; Rac 



heACF for 

EMU sources that we will measure in two dimen- 
sions on the sky, ignoring any potential availabil- 
ity of redshifts, and making no distinction between 
source types, which have different biases and num- 
ber densities as a function of redshift. 

Despite these limitations, the ACF is of value 
for two purposes. First, cosmological constraints 
from this probe can be combined with those from 
other probes to substantially improve the net cos- 



Figure 11 Predicted constraints from EMU on 
dark energy parameters, assuming no redshifts are 
available. The outer grey ellipse shows the current 
constraints from Type IA supernovae and CMB 
fluctuations. The intermediate ellipse shows the 
constraints from EMU using the ISW effect, pro- 
viding only a modest improvement over existing 
measurements. The inner (dark) ellipse shows the 
constraints from EMU using all effects, including 
cosmic magnification, and shows a significant im- 
provement over existing measurements. In par- 
ticular, EMU will resolve the difference between 
the current best measurement (star) and the value 
(cross) predicted for a standard ACDM cosmology 
with non-evolving dark energy. 



mological constraint, as shown by the combined 
constraints in Figures [TTJ and [12] Second, the 
behaviour of the ACF on large scales constrains 
non-Gaussianity (i.e. the extent to which density 
fluctuations in the early universe were distributed 
with non-Gaussian noise) hence providing a win- 
dow on early-Universe physics. Non-Gaussianity 
indicates a positive skewness of the matter density 
probability distribution, which would lead to an 
increased bias for large-scale halos (see e.g. Dalai 



et al. ||2008[ ). EMU will be able to detect non- 
Gaussianity parameterize d by the fjqi, paramete r 
at the level Jnl > 8, ( |Raccanelli et al" 2011 



Such a detection would be difficult to reconci 
with a simple, single scalar field inflation model 
for the early Universe. 

2.9.3 The Integrated Sachs- Wolfe Effect 

The ISW effect can be understood as follows. Trav- 
elling from the last scattering surface to us, CMB 
photons pass through gravitational potential wells 
of large structures such as superclusters that lie 
along our line of sight. In an Einstein-de Sitter 
universe, the blueshift of a photon falling into a 
well is cancelled by the redshift as it climbs out. 
However, in a universe with dark energy or mod- 
ified GR, the local gravitational potential varies 
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Figure 12 Forecast of constraints for modified grav- 
ity parameters, with symbols the same as for Fig. 



11 The largest ellipse shows constraints from cur- 
rent measurements, and the smallest ellipse shows 
the constraints from EMU using all effects, but 
assuming no redshifts are available. EMU will re- 
solve the difference between the current best mea- 
surement (star) and the value (cross) for standard 
General Relativity, either confirming or ruling out 
some alternative models. 



with time, stretching the potential well while pho- 
tons are traversing it, so that the blueshift on en- 
try is not fully cancelled by the redshift on exit. 
Thus the net photon energy is increased, produc- 
ing CMB temperature anisotropics which make the 
CMB appear slightly hotter in the direction of su- 
perclusters. 

The effect is very weak, but can be detected by 
cross-correlating CMB ma ps with tracers o f large- 
scale structure ( Crittenden & Turok 1996[ ) such as 
radio sources. The detection of the effect has been 
(e-g 



repeat edly confirmed (e.g. Giannantonio et al. 
2008a and references therein); for instance, SDSS 
galaxies in the Sloan Digital Sky Survey (SDSS 



York et al 



2000 1 have been cr oss-correlated with 
the CMB ( (Granett et al.|2008[ ) to achie ve a 4-g re- 
sult. I n the radio, cross-correlation by |Raccanelli| 



et al.| (|2008| of the NVSS radio galaxies with the 
CMB anisotropies also gave a tentative detection 
of the ISW effect. The NVSS detection is partly 
limited by shot noise, and so the far greater num- 
ber of galaxies in EMU should achieve a robust 
measurement of the effect. 

EMU will also be able to constrain the redshift 
evolution of the equation of state of dark energy, 
which would allow us to distinguish between differ- 
ent models of dark energy, such as a cosm ological 
constant ( Tegmark 20041, quintessence dZiatev 



et al. 1999), early dark energy (Xia et al.p009), 



or Unified Dark Matter models (Bertacca et al 



2011). 

The ISW effect is also sensitive to any modifi- 
cations of gravity, allowing sensitive constraints on 



gravity, and can also be used to test models for the 
evolution of the clustering and bias of radio sources 
( Raccanelli et al.[20 08), and to test models for the 
cosmological evolution of radio sources ( Massardi 



et al. 2010). The predicted cross-correlation func- 
tion for EMU sources is shown in FigjlO] EMU will 
provide sensitive constraints on any modifications 
of gravity. 

2.9.4 Cosmic Magnification 

Large-scale structures along the line of sight to 
a distant source introduce distortions in the im- 
ages of high-redshift sources as a result of gravita- 
tional lensing. The distortions by foreground (low- 
redshift) galaxies increase the apparent area occu- 
pied by background (high-redshift) galaxies, thus 
reducing the observed number density at a given 
flux density. On the other hand, the magnifica- 
tion increases the total flux density of unresolved 
high-redshift sources, thus increasing the observed 
number density at a given flux density. The size of 
the combined change in apparent number density 
due to these two opposing effects is sensitive to the 
assumed cosmological geometry and parameters of 
GR. 

The combined effect, known as cosmic magnifi- 
cation, can be tested by cross-correlating number 
densities of low-redshift sources (e.g. selected from 
the optical surveys or from EMU star-forming galax- 
ies) with number densities of high-redshift sources 
(selected from EMU) . The effect wa s first unam- 
biguously detected by Scranton et al. ( 2005 ) using 



SDSS foreground galaxies and quasars. The large 
database of EMU sources will develop this into a 
powerful technique for testing cosmological mod- 
els. 

2.9.5 Observational tests with EMU 

Deviations from GR or dark energy physics will 
influence the auto-correlation, the ISW and cosmic 
magnification (Zhao et al. 2010). By modelling 



t he E M U sour ce distribution and bias, |Raccanelh 
al. ( 2011[ ) have shown that significant limits 
can be placed on cosmological parameters (such 
as w and w') that describe dark energy, and on 
parameters rj and \i that describe modifications to 
GR flPogosian et al.||2010[ ). GR predicts that r] = 
/i = l. Figs 11 and |12| show that, even without 
redshifts, EMU will place significant constraints 
on both Dark Energy and Modified GR. 

Once shot noise is fully overcome, ISW mea- 
surements are limited only by cosmic variance. Sub- 
stantial improvements over current measurements 
can therefore be achieved by using a bigger sur- 
vey area. Combining the EMU survey with the 
WO DAN survey will allow us to make a radio mea- 
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surement of the ISW effect over the entire sky for 
the first time, perhaps leading to the highest sig- 
nificance ISW measurement yet. 

2.9.6 Tomographic Analysis 

The discussion of cosmological tests above makes 
the conservative assumption that no redshifts are 
available for EMU sources. If statistical redshifts 
(Sj3.ll I are available, the radio sources can be split 



into rcdshift bins for measuring the ISW effect or 
cosmic magnification, enabling a "tomographic" 
analysis of auto- and cross-correlation data, lead- 
ing to even more significant constraints than those 
discussed above. For example, strong polarised 
sources are known to have a high median redshift, 
even when they are undetected in optical surveys. 

If there are enough sources in each bin to achieve 
a statistically significant ISW detection, EMU will 
enable a measurement of the redshift evolution of 
the ISW effect and so better constrain cosmologi- 
cal models. The results at low redshifts will give an 
independent confirmation of detections achieved at 
optical wavelengths. However, the radio sources 
extend to far higher redshifts, and so test the ISW 
at epochs that cannot be reached at other wave- 
lengths: a detection of the ISW at high redshift 
would be a clear signature of a non ACDM +GR 
e volut ion ( Raccanelli et al.||2011 ) . 



2.10 Galactic Science 

The EMU survey will include the Galactic Plane, 
thus creating a sensitive wide-field atlas of Galac- 
tic continuum emission, which can address several 
science goals including: 

• A complete census of the early stages of mas- 
sive star formation in the Southern Galactic 
Plane, 

• Understanding the complex structures of gi- 
ant HII regions and the inter-relationship of 
dust, ionised gas and triggered star forma- 
tion, 

• Detection of the youngest and most compact 
supernova remnants to the edge of the Galac- 
tic disk, some of which may have exploded 
within the past century, 

• Detection of supernova remnants, especially 
those detected by eRosita but which are un- 
detected by previous radio surveys, 

• Detection of planetary nebulae, which are 
the most abundant compact Galactic sources 
in the NVSS, and can be useful tools for mea- 
suring extinction, and estimating the star- 
formation rate of stars too small to make 



SNe or HII regions ( 


Condon & Kaplan|1998 


Condon et al. ||1999 


1, 



• Detection of radio stars and pulsars, 

• Serendipitous discoveries, such as the radio 
flares from ultra-cool dwarfs found by |Berger| 



et al. (20011. 



In providing a sensitive, high-resolution con- 
tinuum image of the Galactic Plane, EMU will 
complement the GASK AP HI Galactic HI survey 
(Dickey et al. 20101. Existing interferometric 
radio continuum surveys of the Galactic Plane are 
either at high angular resolution but over a limited 
survey area, or cover a wide area at low angular 
resolution. For example, the MAGPIS (Helfand 
eTaTI [20061 and CORNISH surveys ( |Purcell fc 
Hoare 2010) cover an area ^100 deg 2 at an an- 



gular resolution of 1-6 arcsec, while the Interna- 
tional Galactic Plane Survey consists of a num- 
ber of studies over several hundred deg 2 at a typ- 



ical resolution of ~1 arcmin ( 


McClure-Griffiths 


et al.||2005| |Taylor et al.||2003 


|Stil et al. ||2006l 


Haverkorn et al. 


2006 


). EMU, with its full sky 



coverage, high sensitivity, and ~10 arcsec angular 
resolution, will bridge the gap between these two 
types of survey to reveal newly discovered popu- 
lations of compact HII regions, planetary nebulae 
and young supernova remnants. When combined 
with the WODAN Northern Hemisphere survey 
(see 



1.5) 



EMU will provide a complete census 
of centimetre-wave emission in the Galaxy. 

At centimetre wavelengths the primary mecha- 
nisms for emission from Galactic objects are free- 
free emission from HII regions and planetary neb- 
ulae, synchrotron radiation emitted by supernova 
remnants, and diffuse synchrotron emission emit- 
ted by relativistic cosmic-ray electrons accelerated 
by SNRs. The known radio populations of each 
of these types of object are limited by a combi- 
nation of issues including the limited area covered 
by existing surveys, frequency-dependent selection 
bias (in the case of optically thick HII regions), or 
biases against large scale structure introduced by 
limited uv coverage snapshot surveys. 

Thermal emission from HII regions may be sep- 
arated from non-thermal synchrotron by using the 
correlation between thermal free-free and infrared 



emission (e.g. Helfand et al. 2006 


Thompson et al. 


2006 




Conti & Crowther 2004). 


The radio spec- 



tral index from EMU, and the polarisation data 
from POSSUM, will also be excellent discriminants 
between thermal and non-thermal emission. The 
resolution of EMU is particularly well-m atched to 
MIPSGAL 24 urn ([Carey et al.|[2009|) and Hi- 



G AL 70 /im ( Mol inari et al.|20T0 ) survey images 



of which the latter is an effective tracer of the in- 



tensity of the exciting radiation field ( Compiegne 
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etalpOlol ). 

2.10.1 Star Formation 

EMU's sensitivity allows access to all stages of the 
evolution of a compact HII region (hyper-compact, 
ultracompact and compact), even though the 1.4 
GHz continuum emission will be optically thick, 
and EMU's 10 arcsec resolution is insufficient to 
resolve the ultracompact HII regions. 

Ultracompact HII (UCHII) regions represent 
a young (~10 5 years ol d) phase in the d evelop- 
ment of an HII region ( Churchwell 2002 1 . Most 



UCHII regions have been dis covered by snapshot 
VLA surveys at 5 GHz (e.g. Wood & Churchwell 
l[l989l |Kurtz et al. I [19941 |Walsh et al.||1998[ |Pur- 



cell & Hoare 2010 Urquhart et al. 20091. How 



ever, the snapshot surveys missed an entirely new 
class of optically thick HII region known as hyper- 
compact HII (HCHII) regions ([Kurtz ||2005[) and 
missed a diffuse component ( Kurtz et al. |1999 



|Longmore et al. 2009) in which many UCHII re- 
gions are embedded. 

To build a complete census of such objects re- 
quires a radio survey that is sensitive to both faint 
objects and also structure on scales of a few tens of 
arcsec. Fig. [13] shows the typical spectral energy 
distribution expected from three types of HII re- 
gion (compact, ultracompact and hypercompact) 
located at a distance of 18 kpc. In the optically 
thick regime the spectral index of free-free emis- 
sion is proportional to v 2 , falling to i/~ 01 in the 



optically thin regime (Mezger & Henderson 1967 ) 



For compact HII regions this turnover occurs at 
~1.5 GHz, whereas for the denser and more opti- 
cally thick UCHII and HCHII regions the turnover 
frequency is shifted to higher values. This high- 
frequency turnover causes HCHII regions to be es- 
sentially undetectable with snapshot 5 GHz sur- 
veys (see Fig. 13). Even though HCHII regions 



are extremely optically thick at 1.4 GHz, the sen- 
sitivity of EMU is sufficient to detect them at dis- 
tances up to 18 kpc, except in regions limited by 
confusing strong sources. Consequently, EMU will 
be able to detect all types of HII region over most 
of the Galaxy, although higher resolution follow- 
up observations with other arrays may be needed 
to separate closely associated ultracompact and 
hypercompact HII regions. Many of these obser- 
vations will be made as part of higher frequency 
surveys like CORNISH flPurcell et al. |2008| >, and 
MeerGAL ( |Thompson et al.|201ip . Together with 
these surveys, EMU will be able to determine the 
turnover frequencies (and hence electron densities) 
of these regions. 
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Figure 13 Spectral energy distribution of HII re- 
gions over the frequency ranges of the EMU & 
CORNISH surveys. The solid, dashed and dot- 
dashed lines represent the brightnesses of spherical 
isothermal and homogeneous hypercompact (HC), 
ultracompact (UC) and compact (C) HII regions 
at an assumed distance of 18 kpc. The red band 
shows the frequency range of the 5 GHz CORNISH 
survey and the blue band the range of the 1.3 GHz 
EMU survey. 5 cr detection limits are indicated by 
horizontal lines. 



2.10.2 Supernova Remnants 

Only ~275 supernova remnants have so far been 
identified within the Milky Way ( |Green |[2009[ ), 
out of an estimated tot al population of bet ween 
500 and 10 00 remnants (|Helfand et al.|2006[ ). Al- 
though the Green ( 2009 ) catalogue represents the 



result of more than ~ 50 years of intensive searches 
with the world largest radio telescopes, it is con- 
sidered to be incomplete and strongly biased by 
selection effects, such as the lack of compact (and 
hence young) and faint remnants, both of which 
should be addressed by the high sensitivity and 
angular resolution of EMU. There also exist strong 
synergies between EMU and the high-energy ob- 
servatories HESS, Fermi & Chandra, which may 
be used to confirm EMU non-thermal candidates 
as pulsar wind nebulae. 

The increased surface-brightness sensitivity and 
higher angular resolution of EMU, combined with 
X-ray data from eROSITA ||Predehl et al.||2010|) 



should allow the identification of more than 200 
faint and diffuse X-ray sources which are currently 



classified as supernova remnant candidates ( Beckei 
eraLl[2009[ ). 
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2.10.3 Stellar radio emission 

In the last few years, the improvement of observa- 
tional capabilities has led to the discovery of ra- 
dio emission in a broad variety of stellar objects 
at diverse stages of their evolution, as shown in 
Fig. [l4j In many cases, radio observations have 
revealed astrophysical phenomena not detectable 
by other methods flGiidel |[2002| . 
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Figure 14 H-R diag ra m of all known radio stars, 

Radio stars can be 



taken from Giidel 



(2002) 



found in almost every segment of the stellar H-R 
diagram. 



Radio stars have so far been detected only by 
targeted observations directed at small samples of 
stars thought likely to be radio emitters. Existing 
observations suffer from limited sensitivity (e.g. no 
radio stars have been detected at the radio lumi- 
nosity of the quiescent Sun, L rac iio ~ 10 4 WHz -1 ), 
and suffer from a strong selection bias in that ob- 
servations have been targeted to study a particular 
aspect of stellar radio emission. Consequently, it 
is difficult to forecast the impact of EMU on the 
field of stellar radio astronomy. 

Much of our knowledge of radio stars comes 
from the study of active stars and binary systems 



(Slee et al. 


1987 


Drake et al. 


1989 


Umana et al. 


1993, 


1995 


Seaquist et al.|1993; Seaquist & Ivison 


1994 


Umana et al. 


1998 


Budding et al. 


1999 


)• 



ated with mass-loss and magnetic phenomena and 
is often highly variable dGiidel |2002[ |Dulk ||1985 



White|2004 1 . Non-thermal radio emission can also 



originate from shocks of colliding winds in massive 
binaries ( |Dougherty fc Williams||2000[ ) . 

Transient events are also observed as narrow- 
band, rapid, intense and highly polarised (up to 



100 %) radio bursts in stellar objects that have a 
strong (and often variable) magnetic field and a 
source of energe tic particles, incl uding RS CVns 
and flare stars (Slee et al. 2008), Brown dwarfs 
( Hall inan et al.|2008[), and chemically pecul iar stars 
|Trigilio et al.|2000||2008)|Ravi et al. |2010[ ). These 



radio flares have generally been interpreted as a 
result of coherent emission mechanisms, includ- 
ing electron cyclotron maser emission and plasma 
emission. This emission has been detected in only 
a few tens of stars, partly because of the limited 
sensitivity of available instruments, but mainly due 
to the absence of a deep all-sky radio survey. 

These imperfect statistics suggest that EMU 
will provide an unbiased sample of several thou- 
sand stellar radio sources, enabling a detailed in- 
vestigation into the physics of radio stars. This 
will include stellar physical parameters, magnetic 
activity, the fraction of active single stars and bi- 
nary objects that show coherent emission, the time- 
scales of its variability and their relationship to 
stellar parameters. These are important diagnos- 
tics for studies of stellar magnetospheres, and can 
probe fundamental parameters such as magnetic 
field intensity and topology, and electron energy 
distribution. 

However, it will be essential to distinguish ra- 
dio stars from extragalactic sources using prop- 
erties such as radio polarisation, SEDs, and the 
proper motions available from Hipparcos, TYCHO 
2 and UCAC3 ( |Helfand et al.|1999||Kimball et al. 
|2009"1 ). 



2.10.4 Pulsars 

The number of pulsars at high Galactic latitude is 
poorly known, and Cameron et al. (2011) have 



found that none of the IFRS detected in the AT- 
LAS survey (Norris et al. 2006 Middelberg et 



al. 2008a) are pulsars, implying that the surface 
density of pulsars with a continuum flux level > 
150/xJy at high Galactic latitude is <1 per 7deg 2 . 
On the other hand, G. Hobbs (private communi- 
cation) has estimated on the basis of other surveys 
that about one pulsar with a continuum flux level 
> 50/iJy should occur roughly every four deg 2 , in 
which case EMU will detect about 8000 pulsars, 
which exceeds the total number of currently known 
pulsars. However, it will be difficult to distin- 
guish these from other continuum sources. Diag- 
nostics will include (a) lack of an optical/IR iden- 
tification, (b) steep spectrum, (c) polarisation, es- 
pecially circular polarisation (d) sidelobes caused 
by variability and scintillation during the observa- 
tion. Candidates satisfying these criteria will will 
be search ed for pulsar em ission using the ASKAP 
COAST ( |Ball et al.||2009[ ) project or conventional 
single-dish pulsar searches. 
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2.11 Unexpected outcomes 

Experience has shown that whenever the sky is ob- 
served to a significantly greater sensitivity, or a sig- 
nificantly new volume of observational phase space 
is explored, new discoveries are made. Even AT- 
LAS, which expanded the phase space of wide-deep 
radio surveys by only a factor of a few, identified a 
prev iously unrecognised class of object (IFRS: see 
f 2.4 1 . Because EMU will be much more sensitive 



than any previous large-scale radio survey, it is 
likely to discover new types of object, or new phe- 
nomena. Although it is impossible to predict their 
nature, we might reasonably expect new classes of 
galaxy, or perhaps even stumble across new Galac- 
tic populations. Furthermore, the large EMU dataset, 
covering large areas of sky, will be able to identify 
extremely rare objects. 

Historically, discoveries of new classes of ob- 
jects occur when an open-minded researcher, in- 
timate with the telescope and with their science, 



2010 


Bell-Burnell 


2009 


different: people wi 


1 ma 



recognises something od d in their data (Ekers 

EMU is unlikely to be 
ce unexpected discoveries 
while carefully using EMU data to test hypothe- 
ses. It is therefore arguable that this process needs 
no planning: any new class of object in the data 
will eventually be discovered anyway. On the other 
hand, given the large volume of data, it is possi- 
ble that a class of objects will lie undiscovered for 
decades because it didn't happen to fall within the 
selection criteria of any astronomer. 

It is therefore important for EMU to plan to 
identify new classes or phenomena, rather than 
hoping to stumble across them. EMU is there- 
fore taking the novel approach of developing data- 
mining techniques to identify those objects that 
don't fall into one of the known categories of as- 
tronomical object. A data-mining project (named 
Widefield ouTlier Finder, or WTF) is being estab- 
lished that will attempt to assign each object in the 
EMU catalogue to a known class of astronomical 
object, using the available cross-identifications to 
compare colours, luminosities, and any other avail- 
able data. It will then identify those that are out- 
liers or which depart systematically from known 
examples. Most such outliers will simply repre- 
sent bad data, and are thus valuable in their own 
right for debugging ASKAP, while a few may be 
exciting new discoveries. 



2.12 Legacy value 

The largest existing radio survey, shown in Fig. 
1, is the NRA O VLA Sky Survey (N VSS), whose 



will be 45 times more sensitive, 4.5 times higher 
in resolution, with higher sensitivity to extended 
structures. As a result, EMU will detect ~ 38 
times as many sources as NVSS. More importantly, 
the greater sensitivity means that EMU breaks 
into a different regime. While most NVSS sources 
are radio-loud AGNs, EMU will provide estimates 
of star formation rate and radio-mode accretion 
activity in the galaxies currently being studied by 
mainstream astronomers at all other wavelengths. 

The legacy value of large radio surveys depends 
on the ability to obtain a radio image of any ob- 
ject being studied at other wavelengths. Thus the 
legacy value depends critically on the area of sky 
covered: a survey covering the entire sky has more 
than simply twice the scientific value of a survey 
covering half the sky. For example, the number 
of powerful (FRII-type) radio sources predicted at 
the epoch of reionisation is only ^100 over the en- 
tire sky. The ability to identify radio counterparts 
(or upper limits) for any future observation (for 
example the highest redshift gamma-ray burst, su- 
pernova host galaxy, etc.) is essential. Serendip- 
itous discoveries, too, are most likely to be max- 
imised by surveying the greatest possible area. As 
a result, EMU extends as far North as +30° dec- 



release paper (Condon et al. 1998) is one of the 



most-cited papers in astronomy. EMU will cover 
the same fraction (~ 75%) of the sky as NVSS, but 



lination, and, together with WODAN (see (1.5), 
will cover the entire sky. The combined EMU and 
WODAN catalogues are expected to become the 
primary radio source catalogue for all astronomers, 
and will not be superseded until after the SKA be- 
gins operation. 

3 EMU Design Study 

3.1 The need for a Design Study 

In the two years prior to commencement of the 
EMU Survey, the EMU Design Study is addressing 
the following challenges: 

• Developing an optimum observing strategy, 
while taking into account the needs of poten- 
tially commensal surveys, 

• Working with the ASKAP computing group 
to ensure an optimum processing pipeline, 
and an optimum source extraction algorithm, 
including extraction of extended sources, 

• Addressing technical issues of dynamic range, 
confusion, etc., 

• Developing an optical/IR identification pipeline, 
including a citizen science project, 

• Exploring the use of Statistical Redshifts (see 
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• Developing the data access process, and iden- 
tifying potential data issues, 

• Developing "value-added" projects to max- 
imise the science return from EMU, 

• Refining the EMU Science Goals, to ensure 
that the EMU survey is optimised to address 
them, and planning early science papers. 

The EMU Design Study takes place through a 
number of working groups who each take responsi- 
bility for a number of goals, milestones, and deliv- 
erables. The overall Design Study goal is to have 
a preliminary processing and analysis pipeline in 
place by November 2011, with a fully- functioning 
pipeline in place by November 2012. 

3.2 The role of ATLAS in the EMU 
Design Study 

An important component of the EMU Design Study 
is the ATLAS project (|Norris et al.|2006| |Middel 



berg et al.|[2008a| |Hales et al.||2011[ |Banfield et 



al. 2011 ) , which has a sensitivity, resolution and 
science goals similar to EMU, but over a much 
smaller area of sky (7deg 2 surrounding the CDFS 
and ELAIS-S1 fields). The ATLAS survey, to- 



gether with earlier observations o f the HDFS ( Nor- 
ris et al.|2005[|Huynh et al.|2005 ), is being used as 



a test-bed for many of the techniques being devel- 
oped for EMU, as well as guiding the development 
of the science goals and survey strategy. 

For example, the prototype EMU source ex- 
traction and identification pipeline will be used for 
the final ATLAS data release in late 2011. During 
the course of the ATLAS project, dynamic ranges 
approaching ~ 10 5 have been reached, which are 
amongst the highest achieved in radio astronomy, 
and imaging artefacts have been encountered which 
have been rectified in some specific cases, and a so- 
lution for the general case is being tackled. Such 
imaging problems must be solved not only for EMU, 
but for other SKA Pathfinder projects. 

3.3 Science Data Processing 

The data from the correlator will be reduced in 



an automated processing pipeline ( Cornwell et al 



2011 ), which includes the following steps for EMU: 



• Flag for known radio frequency interference, 
working from a database of known RFI sources, 

• Identify unknown radio frequency interfer- 
ence, saving candidate identification, and iden- 
tify and flag further bad data, 

• Solve for calibration parameters (i.e. frequency- 
dependent complex gains) using least squares 



fits of the predicted visibility (from previ- 
ously obtained model of the sky) to the ob- 
served visibility, 

• Apply calibration parameters, predicting for- 
ward from the last previous solution, 

• Average visibility data to required temporal 
and spectral resolution, 

• Construct an image by (a) gridding the data 
using convolutional resampling, (b) Fourier 
transforming to the image plane, and (c) De- 
convolving the point spread function, 

• Find sources in the resulting image or cube, 

• Save science data products to the ASKAP 
Science Data Archive. 

The measured flux densities from most radio 
surveys have a typical accuracy of 10%, although 
the calibration accuracy should, in principle, en- 
able flux densities to be measured to an accuracy 
of ~ 1%. This latter figure has been adopted as 
the target for EMU, and the EMU Design Study 
will investigate the reason that most surveys fail 
to achieve this figure. A number of instrumen- 
tal effects and completeness corrections have been 
identified ( Hales et al.|2011[ ), and may need to be 
applied to the data, including the following. 

• The primary beam model must be accurate 
to ~ 1% so that the data can be corrected for 
the primary beam response to this accuracy. 
In the case of ASKAP, this will vary across 
the PAF, and so needs to be determined for 
each of the 36 beams. 

• Position-dependent bandwidth smearing (chro- 
matic aberration) over the mosaiced image 
due to the finite bandwidth of frequency cha n- 
nels 



( [Condon et al.||1998| |Ibar et al.||2009| ) 



Surprisingly, mosaiced images suffer more from 
bandwidth smearing than pointed observa- 
tions, since any location in the image, even 
at the centre of a pointing, may include many 
contributions from adjacent pointings in which 
bandwidth smearing is significant. Thus, even 
for tight angular spacing between pointings 
in a mosaic, bandwidth smearing will always 
be non-zero at any location over the final mo- 



saiced image. Hales et al. (2011 ) found that 



bandwidth smearing in central regions of the 
ATLAS mosaiced images, which used 8 MHz 
channels, typically caused a 10% decrement 
in peak flux due to this overlapping effect. 
Even with ASKAP 's 1 MHz channels, this 
correction will be necessary if 1% accuracy 
is to be achieved. 
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• Position-dependent time-average smearing over 
the mosaiced image due to finite integration 



time and Earth rotation (Bridle & Schwab 



the field to high accuracy, and keep it as uni- 
form as possible. 



1999}. This is thought not to be a signifi- 3<4 Simulations and Imaging Pipeline 



cant issue for ASKAP, but is included here 
for completeness. 

• Clean bias in measured fluxes, which may be 
mitigated by suitable weighting schemes (to 
control beam sidelobe properties) and clean- 
ing depth. 

In addition, there are observational effects which 
may bias the fluxes of individual sources and/or 
the statistical properties of the sample as a whole, 
as follows. 



The probability of measuring a faint source 
located on a noise peak is higher than the 
probability of measuring a strong source lo- 
cated in a noise trough, because faint sources 
are more numerous ( [Hogg fc TurTIe71[T998l ). 
This results in a bias on all measured source 
fluxes, which decreases with increasing signal- 
to-noisc. When considered in terms of source 



counts this is known as Eddington bi as (|Ed- 
dington ||1913| |Simpson et al.||2006|) 



Hales 



et al. (20111 have shown that Eddington 



bias can also be a sensitive probe of number 
counts below the flux limit of a survey. 

• Resolution bias due to the lack of sensitivity 
to resolved sources, which can be manifested 
in two ways. First, the lack of short baselines 
can limit the maximum angular scale observ- 
able, although this is not likely to be a prob- 
lem for EMU. Second, faint resolved sources 
may have integrated fluxes sufficient to be 
included in the final catalogue, but may be 
missed in the source extraction process be- 
cause their peak fluxes fall below the signal- 
to-noise source detection threshold (e.g. 



Prar 



doni et aL]|2001 1 . This necessitates a resolu 



tion bias correction. 

Sensitivity will generally vary across an im- 
age, so that the area surveyed to any limit is 
a function of that limit. Since source counts 
must be normalised by the area surveyed, a 
completeness correction must be applied to 
account for the position-dependent sensitiv- 
ity t o faint sources acro ss the mosaiced image 
(e.g. Bondi et al.|2003 ;), which must also take 
into account the position-dependent time-averag 



The goals of the simulations and imaging pipeline 
working group are to: 

• Develop a realistic extragalactic simulated 
sky, 

• Develop a realistic Galactic simulated sky, 

• Work with ASKAP engineers to choose opti- 
mum PAF configuration and weighting scheme, 

• Ensure ASKAP imaging processes are robust 
to Galactic-Plane observations, 

• Optimise imaging algorithms and parame- 
ters, 

• Ensure that EMU data will reach a dynamic 
range of 10 5 , 

• Ensure that EMU measured flux densities 
are accurate to 1%. 

3.4.1 Simulation Analysis 

Simulations have been performed using the SKA 



Design Study (SKADS) simulated sky (|Wilman et 



al. 2008 1 as initial input. This sky is augmented 



by the addition of extended, diffuse, or complex 
sources, which are not fully represented in the SKADS 
sky. This sky is then used to generate uv data sim- 
ilar to that which will be produced by ASKAP, 
using the antenna, beam and array characteris- 
tics. Thermal noise is added to the visibilities but 
calibration is assumed to be ideal. The data are 
then proc essed using the ASK APsoft processing 
pipeline (Cornwell et al. 2011) which is the pro- 



totype version of the ASKAP processing software. 
Current simulations use all 36 antennas, the full 
300 MHz bandwidth, and 8 hours of on-source time 
with approximately uniform weighting. A typical 



and bandwidth-smearing effects ( Hales et al 



2011). For the same reason, large variations 



in sensitivity across the field are likely to lead 
to uncertainties in source counts, and so it 
is essential to measure the sensitivity across 



observed simulated sky is shown in Fig. 15 Re- 
sults so far suggest that the majority of fields will 
reach the required sensitivity and dynamic range, 
and the rest of this section considers those few re- 
maining fields containing strong sources, such as 
that shown in Fig. |16[ where this may not be the 
case. 

The analysis of the simulations focusses on the 
;e difference image, which is the difference between 
the dirty image and the input model convolved 
with the point spread function (PSF). In regions 
containing no strong sources, the rms noise in the 
difference image is close to the value predicted 
from receiver performance (i.e. 10 /iJy for an EMU 
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Figure 15 Part of the simulated sky as observed 
by the simulated ASKAP telescope and processed 
using ASKAPsoft. (Left) The input model sky. 
(Right) The difference between the observed sky 
image and the input sky showing artefacts caused 
by the observing process and deconvolution errors. 
The intensity scale of both images is increased to 
highlight these errors. 




Figure 16 (Left) The simulated sky around Pictor- 
A as observed by the simulated ASKAP telescope 
and processed using ASKAPsoft. The greyscale 
ranges from -0.01 to 0.03 Jy/beam, with a peak 
of 0.9 Jy/beam. (Right) The difference between 
the observed sky image and the input sky showing 
artefacts. This image is enhanced by a factor of 
ten to show up artefacts, so the greyscale ranges 
from -0.001 to 0.003 Jy/beam. 



image) . However, it is higher in regions containing 
bright or extended sources, because of the follow- 
ing effects. 



• Insufficient sampling in the image domain, 
particularly for b right point sources that ar e 
not pixel-centred iBriggs & Cornwell (1992). 
Simulations suggest that 4 pixels/beam will 
limit the dynamic range to ~ 2000-3000 in 
these regions, and so ~ 5-6 pixels across the 
beam are planned to achieve the required 10 5 
dynamic range.. 



Insufficient scales for the multi-scale CLEAN. 
The three scales used in earlier simulations 
were insufficient to deconvolve the more ex- 
tended sources in the field, and so five scales 
(as used in the simulation results shown in 
Fig. 15 ) will be implemented in the final 



• A large CLEAN threshold. A CLEAN thresh- 
old of 1 m Jy/beam was used in the simu- 
lation, which will leave residual lobes of ~ 
20 /^Jy/beam around sources slightly weaker 
than 1 m Jy/beam. A lower value of CLEAN 
threshold will be used for ASKAP. 



As described in §1.2| ASKAP will generate a 
sky model of sources stronger than ~ 1 mJy, which 
will be subtracted from the observed uv data be- 
fore imaging. As a result, many of the effects de- 
scribed above are not expected to be significant 
in ASKAP data, but simulations are continuing to 
test this hypothesis. 

3.4.2 Future Simulations 

A number of simulations are needed to ensure that 
the pipeline will deliver EMU science: 



• Simulate low surface-brightness large-scale 
structures (e.g. Fig. 



16) and test the ef- 



fectiveness of the pipeline in retaining these 
structures, 



• Ensure that preconditioning (see { 3.4.3 ) can 
recover the source structures required for EMU 
science, 

• Simulate faint (well below the rms) low surface- 
brightness large-scale structures to check that 
they can be recovered using correlation and 
stacking techniques, 

• Test deconvolution effectiveness on sources 
with curved radio spectra. 



3.4.3 Tapering 

Radio synthesis imaging offers a trade-off between 
naturally weighted images (to optimise sensitiv- 
ity) and uniformly weighted (to optimise resolu- 
tion) images. The ASK AP processing softwar e 
will use preconditioning (Cornwell et al. 2011 



rather than traditional weighting, and multi-sca 
clean (Cornwell 2008), which reduces the im 



processing pipelines. 



pact of these effects. Nevertheless, in effect both 
naturally weighted and uniformly weighted data 
may be required, implying two processing pipelines 
for continuum data. Extended structures may re- 
quire an even more heavily tapered weighting. Fu- 
ture simulations will explore potential solutions, 
including the use of Briggs robust weighting that 
may provide an optimal compromise between the 
two. They will also explore the possibility of off- 
line processing to generate more heavily tapered 
images than those produced in the real-time pipeline, 
and to determine the optimal on-line weighting to 
facilitate this with minimum loss of information. 
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3.5 Source Extraction and Measure- 
ment 

Several radio source extraction tools are widely 
used, such as the Miriad /AIPS Gaussian fit ting 
routines IMS AD, SAD and V SAD, Sextractor flBeij tin 
fc Arnouts |[l996l), SFIND ([Hopkins et al.||2002j 



and Duchamp (Whiting 2008^ . There are also 
numerous specific tools, such as HAPPY (a mod- 
ified version of SAD used in the FIRST survey 



(White et al. 19971, a machine- learning back-end 



to VSAD, used to construct the SUMSS catalogue 



flMauch et alpM] ), BDSM (used for the LOFAR 
source-finding, N. Mohan, in prep), and the flood- 
fill algorithm being used in the ATLAS Data Re- 
lease 2 ( |Murphy et al. ||2007| pales et al.||2011[ ). 

However, none of these is fully adequate for 
EMU, particularly (a) in extended sources, (b) 
in the presence of artefacts from bright sources, 
and (c) in images whose signal-to-noise ratio varies 
across the image. It appears that an optimum 
strategy has not yet been constructed, particularly 
for extended sources, and there is still no truly 
general-purpose and robust source- finding tool ap- 
propriate for the application to large-area radio 
surveys. Moreover, there has not been a pub- 
lished analysis that thoroughly and quantitatively 
compares the details of source-finding (from defin- 
ing thresholds, identifying pixels to fit, fitting ap- 
proaches, approaches to multiple-component and 
extended sources, etc.) as implemented in differ- 
ent existing tools. 

Since the size of the EMU survey precludes 
manual intervention in the source extraction pro- 
cess, it is necessary to build efficient source extrac- 
tion algorithms that are robust to varying signal- 
to-noise across the image, sensitive to source scales 
ranging from unresolved through complex and multiple- 
component sources, to extended low-surface-brightness 
structures, and that require no manual interven- 
tion. The Design Study is therefore building on ex- 
isting expertise to identify the optimum approach 
to source- finding, and produce a robust continuum 
source finder for both point sources and extended 
emission. 

The primary goal is to develop an algorithm (or 
suite of algorithms) that will be implemented both 
in the ASKAPsoft data reduction pipeline and also 
in at least one of the standard radio-astronomical 
reduction packages, such as casa, as well as being 
published in a journal paper. The software tool 
or tools will also be made available for use by all 
members of the astronomical community for pro- 
cessing data from other instruments. 

Specific goals are to develop optimum source 
identification and measurement strategies and al- 
gorithms, quantify the effects and limitations of 
confusion, and identify mitigation strategies for 



source-measurement of overlapping or multiple sources. 

The problem is split into two parts, one dealing 
with point sources (including marginally extended 
sources still well- modelled by Gaussian fits), and 
the other with extended and complex source struc- 
tures. These initial approaches will be extended 
to include more sophisticated investigations, con- 
trasting wavelet or other decomposition approaches 
with machine-learning and genetic algorithms, in 
particular for the extended and complex source 
identification case, where information available at 
other wavelengths may also be introduced. 

An important part of this project is to build 
linka ges wi th other survey groups through SPARCS 
(see 



3.13) so that the various efforts underway 



worldwide can be coordinated, taking advantage 
of best practice where established, and combining 
resources where practical. 

3.5.1 Compact Source extraction and mea- 
surement 

The point-source investigation is comparing differ- 
ent background subtraction approaches (e.g., the 
polynomial mesh of SExtractor, and the erosion/dilation 
algorithm of Rudnick ( |2002[ ) ) , different approaches 
to thresholding (e.g. simple "canonical" thresh- 
olds of 7K7, and False Discovery Rate (FDR) meth- 
ods ( |Hopkins et al. 2002 )), and different approaches 
to Gaussian fitting. 

This includes a comparison of existing source- 
finding and measurement tools, to identify their 
advantages and disadvantages, and to determine 
whether improvements can be incorporated into 
the existin g ASKAP source-finder (Duchamp: Wl it- 
ing 2008). For each tool, the completeness (frac- 
tion of artificial sources identified) and reliability 
(fraction of identified sources that are true model 
sources, as opposed to false detections) are mea- 
sured as a function of flux density. 

While all source detection algorithms become 
incomplete and unreliable at low signal to noise, 
detailed comparisons can identify whether particu- 
lar source finders are more reliable at fainter levels 
Furthermore, algorithms with the same complete- 
ness and reliability can produce different source 
catalogues. Understanding how these catalogues 
differ, and how they affect the science of the ASKAP 



pr ojects , is currently being studied (|Hancock et 
aI1 [20TT| ). 



The primary result so far has been to recog- 
nise the importance of accurately measuring the 
background noise level, which may vary across the 
field because of artefacts, diffuse emission, and 
varying sensitivity, and subtracting any varying 
low-surface-brightness emission. Given the size of 
ASKAP images, this background variation often 
appears on scales larger than the filters coded into 
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standard source- finders. Errors on the measured 
flux densities are clearly correlated with the back- 
ground structure, which can be removed using a 
filter such as that shown in Fig. [lTJ However, it 
may be challenging to determine an appropriate 
scale size for the filter that can be used automati- 
cally and robustly in EMU observations. 

The next step is to understand why sources 
are identified by one source-finder but not by oth- 
ers. This is often caused by the way tools group 
pixels into objects, such as objects assigned as ex- 
tended by SExtractor, but measured as multiple 
separate sources by Gaussian fitting tools such as 
VSAD and Sfind. This analysis will then be ex- 
tended to explore those objects that are poorly 
fitted by any of the source-finding tools, to pro- 
duce a list of recommendations for implementation 
in the ASKAP source detection pipeline to achieve 
optimal source-extraction and measurement. 

3.5.2 Extended and Complex Source Ex- 
traction and Measurement 

The investigation into optimising detectability of 
extended structure is exploring a variety of filters 
that maximise the detectability of extended struc- 
ture, such as a spatial scale filter ( Rudnick & Lem- 
|2009[) t o identify extended emission, as 



merman 



shown in Fig. 17 



Figure 17 Left: a portion of the original SST3 con- 
tinuum simulation, including artefacts in the top 
left-hand corner caused by a strong source just out- 
side the field; Right: result of applying a spatial fil- 
ter to remove large-scale structure. Note that the 
diffuse Gaussians in the South are also completely 
removed. Both images have the same scaling, with 
an rms of ~ 15 /iJy/beam. 



Other a pproaches inclu de the use of the Hough 
transform ( Ballard "||1981 1 to measure circular or 
elliptical structures (such as SN remnants, or ra- 
dio relic sources), and using compressed sensing 
imaging technique s to radio interferometric data 
( Wiaux et al. 2009 1. The study focusses on metrics 



seems to hold promise for different kinds of struc- 
ture and measures, and the outputs are now being 
quantified and compared, to identify a suitable im- 
plementation for the ASKAP pipeline. 

3.6 Survey Strategy 

One challenge in planning a large survey like EMU 
is to develop the optimal survey strategy to achieve 
the primary science goals, while taking into ac- 
count the capabilities and limitations of ASKAP 
and the need for the most efficient use of telescope 
time. Prior to commissioning, several key issues 
must be considered. 

3.6.1 Source Density and Confusion 




Condon (2008) has shown that a beamsize of 



10 arcsec is near-optimum for ASKAP in terms 
of the trade-off between resolution and confusion, 
and that the rms sensitivity is increased by ~ 10% 
by confusion noise. This is consistent with pre- 
vious observations (Hopkins et al.||2003[ |Norris 



et al. 2005 Huynh et al. 



2005[ ) which have shown 



that the confusion level at the rms sensitivity (10 
/jjy/beam) and resolution (10 arcsec) of EMU is 
~ 50 synthesised beams per source, so that only a 
small fraction of EMU sources will be affected by 
confusion. Nevertheless, several science goals of 
EMU, in particular those focusing on diffuse emis- 
sion and stacking experiments, rely on solid un- 
derstanding and mitigation of confusion and noise 
variations. 

J Simulations conducted by the ASKAP comput- 
ing team, supplemented by model diffuse sources, 
confirm that confusion is not the limiting factor 
at 10 arcsec, but will be an issue at the 1 arcmin 
resolution ideal for the detection of large-scale dif- 
fuse emission. Compact source subtraction and 
filtering is being explored in order overcome this 
issue. Initial experiments (e.g., Fig. 18) have im- 



to compare techniques, and implementing suitable 
model sources in the simulations. Each technique 



proved our understanding of these issues, and will 
inform the diagnostics needed in both the science 
and imaging simulations. 

3.6.2 Choice of Observing Frequency 

EMU will observe a single contiguous band at 1130- 
1430 MHz, which is a compromise between (a) the 
reduced PAF beam filling factor above 1400 MHz, 
(b) the poorer spatial resolution at lower frequen- 
cies, and (c) the desire to observe commensally 
with the WALLABY project, for which the fre- 
quency range is determined by redshifted hydro- 
gen. On the other hand, other factors (design con- 
straints, choice of commensal observing programs) 
may necessitate revising this choice, and some con- 
tinuum data may also be available as a by-product 
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Figure 18 Top: Original 10-arcsec resolution 
ASKAP simulation with two diffuse radio ha- 
los injected; Centre: Same as above convolved 
to 60 arcsec resolution. The image is confusion 
limited at 40-100 /zJy/beam; Bottom: Same as 
above but with the SKADS source model sub- 
tracted (with errors). This image reaches ther- 
mal noise on small scales, with a typical rms of 
20-50 ^(Jy/beam. 



from other ASKAP projects (e.g. POSSUM, DINGO, 
FLASH) . Therefore the EMU Design Study simu- 
lations are including observing frequency as a free 
parameter. 

3.6.3 Observing Strategy 

To achieve the expected sensitivity in EMU it will 
be necessary either to integrate at each pointing 
for ~12h and then move over to the next pointing, 
or take many short integrations at each pointing. 
The hrst option allows the required depth to be 
reached without requiring any historical data, and 
is robust to variability (on timescales >12h), and 
changes in the dirty beam caused by instrumental 
failures. It will also allow an early identification of 
any instrumentation or systematic issues that may 



restrict our dynamic range. The second option is 
difficult to implement as it requires the processing 
pipeline to have access to historical data, but will 
detect variability, and will provide graceful degra- 
dation to array faults. For example, a hardware 
failure on a particular day may result in a reduc- 
tion of sensitivity over several fields, rather than 
causing a total loss of one field. Option 1 is max- 
imally commensal with WALLABY while option 
2 is maximally commensal with VAST. It is likely 
that a variant of Option 1 will be adopted, aug- 
mented by some dithering or interlacing to achieve 
the necessary sensitivity uniformity (see \ 3.6.6 1. 



3.6.4 Tiling Strategy 

Because of computational overheads, it will be in- 
efficient for the ASKAP pipeline processor to com- 
bine data from overlapping pointings obtained on 
different dates. The loss of observing efficiency 
caused by overlapping pointings is minimised by 
the square feed array that has been adopted for 
ASKAP. Initial tiling experimentation has begun 
with tiling stripes of constant declination, motivat- 
ing a coordinated EMU/ WALLABY move towards 
a unified formalism. 

The trade-offs between tiling strategies and PAF 
weighting schemes are not yet fully understood. 
Weighting schemes may be optimised for (a) max- 
imum field of view, (b) maximum sensitivity, (c) 
uniform sensitivity, or (d) best polarisation response, 
and that decision may influence the tiling strat- 
egy. A further factor is the need to combine (or 
"stitch") images and catalogues across adjacent 
fields, which may also have some impact on tiling 
strategy. 

3.6.5 Dynamic Range 

Dynamic range may be the biggest factor in limit- 
ing the uniformity and completeness of EMU, and 
it drives many of the ASKAP design requirements, 
such as the 3-axis telescopes, the strict pointing re- 
quirements, and the good gain stability of the an- 
tenna front-end subsystems. It also drives much 
of the current EM U design study, such as the sim- 
ulations (see ^3.4). 

Each 30 deg^ field will contain about two sources 
stronger than 1 Jy. The target rms noise is 10 
/^Jy/beam, and so in principle a dynamic range 
of ~ 10 5 across the 30deg 2 FOV is required. Al- 
though the effect of a strong source on adjacent 
beams is currently unclear, the 1 Jy sources will 
not be present in all the sub-beams within the 
30deg 2 . Therefore, even if dynamic range limits 
some sub-beams, 10 /iJy/beam rms should still be 
reached in the majority of sub-beams. Simulated 
ASKAP data is being used to test these effects and 
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others, such as the effect of strong diffuse emission 
from the Galactic Plane. Other issues include the 
optimum weighting of the PAF feeds, the required 
uniformity across the beam, the optimum observ- 
ing strategy, and choice of observing parameters 
such as weighting. So far, simulations (see [3.4 1 



suggest that the dynamic range will not be limited 
by the data processing, but an unambiguous an- 
swer must await the characterisation of the PAFs 
in the BETA observations. 

3.6.6 Sensitivity Uniformity 

Sensitivity can in principle vary significantly across 
the field of view of a single pointing, although this 
will be mitigated by a planned dithering pattern 
that should reduce non- uniformity to below 10%. 
Nevertheless, it is important to estimate what the 
required uniformity is for EMU science goals be- 
fore a final observing/dithering strategy is reached. 
We consider two cases. 

• Galaxies, AGNs, & Star Formation: Com- 
pleteness is a critical component, and science 
goals permit some sensitivity variations pro- 
vided the target 10 /xJy/beam rms is reached 
at the completeness cutoff. However, large 
sensitivity variations result in loss of observ- 
ing time: a 20% gain variation in one spot 
would require the completeness threshold to 
be set 20% higher across the field, and since 
sensitivity oc (observing time) 2 , this will re- 
sult in ~ 40% of the data being discarded. 

• Diffuse emission: The scale size of any sen- 
sitivity variations must be much smaller or 
much larger than the diffuse emission of in- 
terest, otherwise the variations act as con- 
fusion noise. Simulations have shown that 
variations larger than ~20-25 arcmin can be 
effectively removed through spatial scale fil- 
tering. Noise variations on scales between 
2 and 20 arcmin are the most problematic, 
though detailed knowledge and modelling of 
the variations will enable the data to be par- 
tially corrected for such variations before dif- 
fuse source extraction. 

3.7 Commissioning and Science Ver- 
ification 

The EMU team will play an active role in the 
ASKAP science verification tests, which will in- 
clude observations of a well-studied field with ex- 
isting 20cm data with the same sensitivity and 
resolution as EMU. In addition, that field will be 
observed regularly throughout the commissioning 
and operational life of ASKAP. Not only will such 
a process enable checks against images and results 



obtained with other instruments, but it will also 
provide a regular check on stability, and give warn- 
ing of subtle problems that may arise during the 
lifetime of ASKAP. 

These observations will also result in a set of 
radio data which can be co-added to provide an 
extremely deep field down to the confusion limit, 
and so the field should be chosen to maximise the 
availability of multi-wavelength data. Probably 
the best studied field in the sky is that surrounding 
ATLAS-CDFS, which offers: 

• 30 deg 2 which has been mapped with the VLA 
at 20cm as part of NVSS to an rms of 0.45 
mjy/beam, 

• 4 deg 2 which has been mapped with the ATCA 
at 20cm as part of ATLAS, to an rms of 
10/xJy/beam, reproducing the rms, resolu- 
tion, and dynamic range that are closely matched 
to the Design Study goals of ASKAP, includ- 
ing polarisation, spectral index, wide-field 
VLBI, and variability studies, 

• Probably the most intensely studied field in 
the sky at optical IR and X-ray wavelengths, 
resulting in important science from the re- 
peated (and hence deep) ASKAP observa- 
tions, 

• Areas of sky which have been mapped to 
an rms of lO^Jy/beam situated a few de- 
grees away from sources with flux densities 
> 1 Jy, which will be invaluable for checking 
dynamic range. 

T he fi rst observations will take place on BETA 
(see [1.2). Although the instantaneous sensitiv- 
ity and spatial resolution of BETA are modest, 
it will be unmatched for surveying large diffuse 
structures until the advent of the full ASKAP ar- 
ray. It is not yet clear whether BETA will be fully 
committed to engineering tests, or whether some 
science might be achievable with BETA. For the 
latter eventuality, science targets have been cho- 
sen that can yield early science publications from 
ASKAP, including observations of clusters and su- 
perclusters including large diffuse galaxies, relics, 
and low-surface brightness shock structures. 

Once the full ASKAP array is operational, the 
EMU survey will start with a number of stringent 
data quality checks before being released into the 
public domain, and this Quality Control process, 
described in j ]4.1[ is the responsibility of the EMU 
team. In particular, it is planned to observe one 
or more fields that have been deeply surveyed by 
other telescopes, such as the ATLAS and COS- 
MOS fields, to check for consistency of results and 
calibration. It is planned also to survey a large 
Northern field at a declination about +25° which 
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will also have been observed with the EVLA, and 
which will subsequently be observed with North- 
ern surveys such as WO DAN to ensure consistency 
between the surveys. 

3.8 Data Format and Access 

3.8.1 EMU Data Requirements 

An EMU data requirements document has been 
continuously updated since the earliest days of EMU. 
The resulting list of primary data products (termed 
"level 5 products" in ASKAP documentation), is 
summarised in Table [2j However, new require- 
ments continue to arise. For example, a recently 
recognised requirement is to optimise EMU data 
delivery for stacking. To correctly interpret stack- 
ing results requires detailed knowledge of the point- 
spread function at each point of the image, result- 
ing in further data requirements such as (a) storing 
uncleaned residual images without cleaned compo- 
nents being added back in, (b) storing (or being 
able to reconstruct) an accurate representation of 
the dirty beam (which varies significantly across 
the field) at each point in the image. 

In addition to the science-driven data require- 
ments, discussed here, a number of statistical tests 
will be made on the data to check the data quality, 
as described in 94.11 



3.8.2 Value-Added Data 

EMU has an ambitious plan to provide source asso- 
ciations and identifications in near real-time, and 
place them in the public domain. These value- 
added data constitute the "Secondary Data Prod- 
ucts", or, in ASKAP terminology, level 7 data. 
Their generation includes grouping radio compo- 
nents into sources (e.g. classical radio double and 
triple sources etc) and cross-identifying with op- 
tical and infrared surveys. EMU is also collabo- 
rating with the Galaxy Zoo group to extend this 
process into the Citizen Science arena (see ^3.9). 
To achieve this, collaborations have been initiated 
with the appropriate surveys, and discussions are 
taking place on how best to produce these associa- 
tions and cross-identifications, using a probabilis- 
tic approach. In addition, NED (NASA Extra- 
galactic Database) have agreed to accept a near- 
real-time automated query for this process, and 
have also agreed to host the resulting source cata- 
logue and associations. 

3.8.3 Virtual Observatory Server 

Management of primary ASKAP data will be the 
responsibility of the ASKAP Science Data Archive 
Facility (ASDAF). However, existing Virtual Ob- 
servatory (VO) standards have not yet been fully 



Table 2 EMU Primary Data Products 
UVDATA 

uv data will be available in all four Stokes 

parameters, and in all 300 frequency channels. 

IMAGES (in IQUV) 

An intensity image, 

A coverage and sensitivity map 

An uncleaned residual image a , 

Images of spectral index and spectral curvature, 

Point-spread functions across the image 6 , 

Detailed metadata for all the above 

CATALOGUE OF COMPONENTS c 

A unique IAU-registered component name, 

Ra, Dec (J2000) of the peak, 

A peak and integrated flux density, 

The spectral index of the peak and integrated flux, 

The parameters of a fitted Gaussian, 

For failed Gaussian fits, an estimate of the extent 

of the component, 
A postage-stamp image, 
A history of changes to the catalogue entry. 



a Thc residual image after cleaning, in which 
cleaned components, and sources in the ASKAP 
sky model, have not been added back in. This im- 
age will be valuable for stacking and for detecting 
diffuse emission. 

b The point spread function of the uncleaned emis- 
sion, which will vary across the FOV. It is im- 
portant to know this point spread function at any 
point to c alculate the statistics of stacked images 
(see plO I. 



c All measured quantities will be accompanied by 
an estimate of their standard error. 



implemented on a large radio survey, and chal- 
lenges will probably be encountered when we do 
so. To identify these, an internal prototype VO 
server and associated MySQL Database is being 
developed for ATLAS data and is now in regu- 
lar use by ATLAS team members. It is expected 
that further VO development may include mod- 
ifications to VO protocols, or a requirement for 
additional VO Tools. The EMU Design Study in- 
cludes plans to place ATLAS cross-identifications 
and other data (e.g. spectroscopy) on the ATLAS 
VO server, as a step to identifying the EMU VO 
requirements. 

3.9 Source Characterisation and Cross- 
Identification 

The EMU component catalogue will be cross-matched 
in a near-real-time automated pipeline with avail- 
able large-area optical/IR surveys shown in Table 
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[3j For each survey, the quoted sensitivity has been 
used to estimate the fraction of radio sources in 
EMU with S20cm > 50^Jy that will be detected 
by that survey, based on the measured flux densi- 
ties of sources in COSM OS ( |Scoville et al~1|2007 
Schinnerer et al. ||2007[ ). 
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Table 3 Key Multi- Wavelength surveys with which EMU data will be cross-identified (restricted to surveys 
larger than 1000 deg 2 ). All magnitudes in AB. The EMU detected column is the fraction of EMU sources 
that arc detectable by the multi- wavelength survey to its 5 — a limit. The survey matched column is 
the fraction of EMU sources in the previous column which are in the area of sky covered by the multi- 
wavelength survey. The sensitivity shown for the WISE survey is for the 3.5/mi band. 



Survey 


Area 


Wavelength 


Limiting 


EMU 


Survey 


Data 


Name 


(deg 2 ) 


Bands 


Mag. 


Detected 


Matched 


Release 






or flux a 


(%) 


(%) 


Date 


WISE 1 


40000 


3.4, 4.6, 12, 22 ^m 


80 fxJy 


23 


100 


2012 


Pan-Starrs 2 


30000 


9, r, i, z, y 


r < 24.0 


54 


50 


2020 


Wallaby 3 > b 


30000 


20 cm (HI) 


1.6 mjy c 


1 


100 


2013 


LSST 4 


20000 


u, 9, r, i, z, y 


r < 27.5 


96 


67 


2020 


Skymapper 5 


20000 


u, v, g, r, i, z 


r < 22.6 


31 


66 


2015 


VHS 6 


20000 


Y, J, H, K 


K< 20.5 


49 


66 


2012 


SDSS 7 


12000 


u, g, r, i, z 


r < 22.2 


28 


22 


DR8 2011 


DES 8 


5000 


9, r, i, z, y 


r < 25 


71 


17 


2017 


VST-ATLAS 9 


4500 


u, g, r, i, z 


r < 22.3 


30 


15 


2012? 


Viking 10 


1500 


Y, J, H, K 


K< 21.5 


68 


5 


2012 


Pan-Starrs Deep 2 


1200 


0.5 - 0.8, g, r, i, z, y 


g < 27.0 


57 


4 


2020 



a Dcnotes 5a point source detection. However, in many cases, a priori positional information will enable 
3cr data to be used, resulting in a higher detection rate. 

b Being an HI survey, WALLABY will measure redshifts for all detected galaxies out to z = 0.26. 
c per 4 km s -1 c h annel achieved in 8 hours integration. 
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Experience with surveys such as ATLAS and 
COSMOS indicates that ~ 5% of EMU radio sources 
will consist of multiple components such as radio 
double or triple sources with a core-lobe morphol- 
ogy. A key goal of the EMU Design Study is to 
develop an automated cross-identification pipeline, 
significantly more sophisticated than a simple nearest- 
neighbour algorithm, to group the 70 million ra- 
dio components detected by EMU into "sources", 
and cross-identify them with optical and infrared 
sources found by other surveys. The algorithm 
will be run (in an automated pipeline) on the pri- 
mary components catalogue described in Table [2] 
to produce a secondary data product, the "source" 
catalogue, in which components are grouped to- 
gether to form sources, and arc associated with 
optical/IR identifications. In cases where several 
cross-identifications are possible, each will be given 
in the source catalogue along with an estimated 
probability. 

This Design Study builds on the experience 
developed through a number of international and 
multi-wavelength surveys in which EMU team mem- 
bers are named investigators, and some of whom 
are developing similar tools for Herschel, Spitzer 
and SCUBA/SCUBA-2 cross- i dentifications (e.g. 
Roseboom et~aTJ|2009l |20T0| |Smith et g]|2010 
Chapin et al.||2011[ ). 



Given the expected source density, and red- 
shift distribution, of both radio sources and po- 
tential matching catalogues (see T able 3|), a like- 
lihood ratio matching technique (Sutherland & 



Saunders 


1992 


Smith et al. 


2010) 



should be adequate in the majority of cases, es- 
pecially if source SED/colour information is in- 
corporated (Roseboom et al. 2009; Chapin et al. 
2010). The ~ 5% of radio sources (see e.g. Nor- 



ris et al. 20061 consisting of multiple components 
represent a considerable challenge to automated 
cross-identification algorithms, and our proposed 
algorithm will need to account for these. Some 
potential solutions (i.e. using geometric informa- 
tion in matching, simultaneous radio self matching 
and ancillary data matching) are currently being 
developed, and will be tested as part of the Design 
Study. 

We will use Monte-Carlo simulations to derive 
probable misidentification rates for various classes 
of source, and will publish the algorithm and the 
results of the simulations in a journal paper, as 
well as provide code to the astronomical commu- 
nity. During the early part of the Design Study, 
ATLAS data will be used to investigate possible al- 
gorithms. The later part of the Design Study will 
include setting up collaborations, ensuring access 
to appropriate surveys, constructing software for 
the pipeline processor, and running simulations to 



measure the misidentification rates. 

Applying the limits shown in Table [3] to the 
sources detected in ATLAS and COSMOS, it is 
expected that, of the 70 million sources detected 
and catalogued by EMU, ~ 40% will initially have 
identified counterparts in wide-field optical and 
IR surveys (e.g. SkyMapper, VHS, WISE, LSST, 
etc.) and a further few percent will be identified 
by smaller but deeper surveys such as HERMES 
(Seymour et al. 20111. This fraction will rise to 



70% in 2020 with the release of the LSST data. 

Of those 30 million sources with potential iden- 
tifications initially, we estimate that ~ 25 million 
will yield reliable cross-identifications using a sim- 
ple likelihood ratio algorithm as discussed above, 
and a further 3 million will be handled satisfac- 
torily by a more sophisticated cross-identification 
algorithm which will take account of the spectral 
energy distribution at other wavelengths. For ex- 
ample, three radio components in a line could ei- 
ther be three low-redshift star-forming galaxies, 
or a high-redshift FRII galaxy. Three IR galax- 
ies with a star-forming SED coincident with the 
three radio components would indicate the first 
solution, whereas a bright elliptical component at 
the centre, and no identification with the outer 
components, would indicate the latter solution. 

The remaining 2 million are likely to be ex- 
tended, confused, or complex sources for which 
automated algorithms are unlikely to be able to 
make a confident identification. Ironically, these 
are the sources most likely to yield the most inter- 
esting science, but a human brain will be required 
for an optimum cross-identification, and there are 
simply not enough human brains within the EMU 
team (or within the global community of profes- 
sional astronomers) to look at each galaxy. 

In cases where the automated pipeline fails, 
EMU will use a "Citizen Science" approach to en- 
list the help of thousands of enthusiastic amateurs, 



in collaboration with the Galaxy Zoo group ( Lin- 
tott et aL||2008 ). Galaxy Zoo have used this tech- 



nique very successfully for classification of SDSS 
galaxies, and have agreed to collaborate with EMU 
on creating "Radio Zoo" . An initial prototype Ra- 
dioZoo project will be established in mid 2011 us- 
ing ATLAS data. If resources permit, machine- 
learning cross-identification will also be explored, 
following the approach of |Djorgovski et aL ( 2010 ) 
who have extended the "Citizen Science" approach 
by using it to train neural nets and other algo- 
rithms. 



3.10 Stacking and Data-Intensive Re- 
search 

EMU has an additional requirement for facilities 
equipped for data exploration and mining, and 
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data-intensive research, such as the technique of 
stacking. 

The process of stacking involves identifying a 
class of objects which are generally below the de- 
tection limit of a survey, but then combining (typ- 
ically by taking a censored mean or median) to- 
gether the data at the position where such objects 
are expected in the survey. The noise tends to 
cancel, while any low level of flux density in the 
sources adds, resulting in a detection threshold 
very much lower than that of the unstacked sur- 
vey. Stacking at radio wavelengths has been used 
very successfully (e.g. White||2007 Ivison et al. 



20071 [20T0l|Dunne et al.|2009||Messias et al.|20i0 



Bourne et al.|2011[ ) on high-resolution data for the 



purpose of studying faint populations which are 
below the detection threshold of the radio image, 
and has proven to be a powerful tool for studying 
star formation rates, AGN activity, and measuring 
the fraction of the extragalactic background con- 
tributed by various source populations. For exam- 
ple, the radio-infrared correlation can be probed 
down to levels well below the threshold of any ra- 
dio survey by averaging the radio flux density at 
the position of all galaxies with an infrared flux 
density within a particular bin ( Boyle et al.|2007 ) 



The unprecedented area-depth product of EMU 
makes it remarkably suitable for the stacking pro- 
cess. For example, stacking a sample of a million 
optically selected galaxies in the EMU data will 
result in a noise level of ~10 nJy Because of the 
wide area of EMU, even rare classes of source can 
be stacked, and it should also be possible to cre- 
ate stacked images of extended sources, such as 
clusters. However, the extent to which such deep 
stacking will be successful in EMU will depend 
on the extent to which imaging artefacts are can- 
celled by stacking, and the effectiveness of stack- 
ing EMU data will be simulated, and stacking sci- 
ence projects conducted using ATLAS, NVSS, and 
FIRST, to see whether particular observing strate- 
gies or reduction algorithms are necessary. 

In particular, it has been established that stack- 
ing can be biased by a number of effects such as 
failure to account for the point-spread function, 
particularly in the presence of confusion and in 



highly clustered fields ( 


Bourne et al.||2011 


Greve 


et al.||2010 Penner et a 


L||2011| |Chary et a 


.||2008 


Bethermin et al. 2010|) and so stacking experiments 



and simulations will be conducted during the De- 
sign Study to (a) determine what requirements 
these place upon the EMU data (e.g. storing the 
point-spread function at all positions across the 
primary beam) and (b) to establish stacking algo- 
rithms and software which can be used robustly on 
EMU data. 

Other examples of data-intensive research in- 
clude: 



• Identification of sources which do not fit into 
known categories of radio source, and so are 
likely to be artefacts or exciting new classes 
of source, as discussed in §2.11| 

• Extraction of low surface brightness emission 
cross-correlated with millions of galaxies se- 
lected from other surveys. For example, this 
can be used to detect the synchrotron emis- 
sion from cosmic filaments or sheets. 

• Cross-correlation of low-redshift galaxies with 
high-redshift galaxies, or the CMB, to test 
cosmology and fundamental physics as dis- 



cussed in £ 2.9 



The computational facilities required for such 
data-intensive research are currently unfunded, but 
it is hoped that the computational facilities asso- 
ciated with the ASDAF will become available. 

3.11 Measuring Redshifts 

To interpret data from EMU, redshifts are invalu- 
able. Most of the science goals discussed in §2 as- 
sume that redshifts will not be available for most 
EMU sources, although much of the EMU science 
can be greatly enhanced if redshifts are available. 
However, no existing or planned spectroscopic red- 
shift survey can cover more than a tiny fraction of 
EMU's 70 million sources. 

For nearby galaxies, HI redshifts will be avail- 
able from WALLABY, which will provide ~ 5 x 10 5 
redshifts, and smaller numbers will be provided 
by other redshift surveys such as SDSS (|York et 



al. 20001 and GAMA (Driver et al. 2009). The 



remaining ~ 99% of EMU galaxies will not have 
spectroscopic redshifts. 

Photometric redshifts, in which SEDs of var- 
ious templates are fitted to the measured multi- 
band photometry of target galaxies, are widely 
used as a surrogate for spectroscopic redshifts, with 
considerable success. As discussed below, ~ 30% 
of EMU sources will have multi-wavelength opti- 
cal/IR photometric data at the time of data re- 
lease, increasing to - 70% in 2020. The SED 
of star-forming galaxies has a characteristic shape 
which provides good fits to SED modelling, and 
so it is expected that the majority of EMU star- 
forming galaxies (whose host galaxies will mainly 
be low-redshift and hence optically bright) will 
have reliable photometric redshifts. 

Similarly, low-power AGN (such as Seyfert galax- 
ies) and AGNs hosted by elliptical galaxies at z < 
1 are also likely to yield reliable photometric red- 
shifts. 

High-redshift radio-loud AGNS present more of 
a challenge. Photometric redshifts are notoriously 
bad at fitting the power-law SEDs of AGNs, and 
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hence strong radio-selec t ed sou rces. For example, 
Rowan- Robinson et al. (2008) found a 99% suc- 



cess rate in photometric redshifts of galaxies in the 
SWIRE survey, while |Norris et ah] (|2006|) found 



only a 50% success rate in Rowan-Robinson's fits 
for radio sources within that sample, and a suc- 
cess rate indistinguishable from chance for radio 
sour ces at z>l. On the other hand, Salvato et 
al. (2010) have achieved reliable photometric red- 
shiftsoTAGNs in the COSMOS field by using high- 
quality photometry in a large number of bands, 
and taking account of effects such as source vari- 
ability. 

While the relatively sparse photometry avail- 
able for most EMU sources (listed in Table [3]) can 
not generate such accurate photometric redshifts 
for high-redshift sources, it can provide an esti- 
mate of the probability of a given source being 
at a particular redshift, for about 30% of EMU 
galaxies. Such estimates are valuable for statisti- 



cal purposes such as cosmological tests (see {2.9) 



and we term such redshifts "statistical redshifts 

Even a non-detection can carry useful infor- 
mation, and radio data themselves can add signif- 
icantly to the choice of SED template, and hence 
to a probabilistic estimate of redshift. For exam- 
ple, an AGN template may be indicated by the 
radio morphology, the detection of polarisation, 
a radio spectral index that is significantly differ- 
ent from —0.7, or a low far-infrared-radio ratio. 
Alternatively, high-redshift radio galaxies can be 
identified from their strong radio emission coupled 
with a K-band non-detection ( jWillott et al. 2003 ) . 
Even the radio data alone can provide information 
about the probable redshift range. For example, a 
steep radio spectral index increases the probability 
of a high redshift (De Breuck et al. 2002), while 



the angular size of some classes of galaxy can be 
correlated with redshift (e.g. Wardle & Miley 



1974). 



Radio polarisation data c an also provide sta - 
tistical redshift informat ion (Norris et aL|2011c ). 
The POSSUM project QGaensler et al.||2010[ ) will 
run commensally with EMU to generate a cat- 
alogue of polarised fluxes and Faraday rotation 
measures (or upper limits) for all sources detected 
by EMU. Sources that are strongly polarised are 
nearly always AGNs (Hales et al. 2011), and so 
have a mean z ~ 1.88, while unpolarised sources 
are mainly star-forming galaxies with a mean z ~ 
1.08 (see Fig. [5]). Consequently, cosmological tests 
may be made by treating unpolarised sources as a 
low-redshift screen in front of background high- 
redshift polarised sources. 

A significant fraction of statistical redshifts are 
likely to be incorrect. A correction for this can be 
applied in a statistical study of a sufficient num- 
ber of objects, provided that reliability and com- 



pleteness are carefully calibrated in a small well- 
studied area with accu rate spectroscopic redshifts. 
The ATLAS survey (|Norris et al~1|2006| [MiddeT- 



berg et al. 2008a I is expected to yield 16000 ra- 
dio sources at its completion in 2011 (Banfield 



et al. 2011). Spectroscopy on these sources is al- 



ready underway ( Mao et al. 2011b I and other large 



spectroscopy proposals have also been submitted 
to measure redshifts for the ATLAS radio sources. 
This spectroscopy should enable the training of al- 
gorithms to yield, and correct, statistical redshifts 
for - 30% of EMU sources. 

A further initiative is a proposed project to 
make use of the wide-field spectroscopic capabili- 
ties of the UK Schmidt Telescope at Siding Spring. 
The aim of this project, named TAIPAN (Trans- 
forming Astronomical Imaging-surveys through Poly- 
chromatic Analysis of Nebulae), is to extend the 
depth of the 6dFGS ( Jones et al. 2009 ) to achieve a 



complete spectroscopic survey over the full south- 
ern hemisphere, potentially approaching the depths 
of the SDSS spectroscopic survey. This would pro- 
vide invaluable spectroscopic data to complement 
all wide-field ASKAP surveys, and especially EMU. 

3.12 Imaging the Galactic Plane 

Although EMU focuses primarily on extragalactic 
science, Galactic science is also an important sec- 



ondary goal, as discussed in E 2.10 above. Imaging 
the Galactic Plane, or even imaging extragalactic 
sky near the Galactic Plane, introduces additional 
issues which must be tackled in the Design Study, 
such as: 

• What are the limitations in the depth of EMU 
due to the bright extended sources in the 
Galactic Plane? 

• How can the sensitivity to larger-scale struc- 
tures such as HII regions be maximised? 

• How close to the Galactic Plane can the full 
sensitivity and dynamic range be achieved 
for the main extragalactic EMU survey? 

The Design Study attempts to answer these by 
conducting simulations of the standard SKADS 
simulated sky to which Galactic continuum images 
have been added. 

3.13 SPARCS: SKA Pathfinders Ra- 
dio Continuum Survey Work- 
ing Group 

Several next-generation radio telescopes and up- 
grades are already being built around the world, 
including LOFAR (The Netherlands), ASKAP (Aus- 
tralia), Meerkat (South Africa), EVLA (USA), eMER- 
LIN (UK), the Allen Telescope Array (USA), and 
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Apertif (The Netherlands). Most of these projects 
have multiple science goals, but they have one goal 
in common, which is to survey the radio continuum 
emission from galaxies, in order to understand the 
formation and evolution of galaxies over cosmic 
time, and the cosmological parameters and large- 
scale structures which drive it. In pursuit of this 
goal, the different teams are developing techniques 
such as multi-scale deconvolution, source extrac- 
tion and classification, and multi-wavelength cross- 
identification. Furthermore, these projects share 
specific science goals, which in most cases require 
further definition before a well-planned survey can 
be executed. Finally, each of these new instru- 
ments has different strengths, and coordination of 
surveys between them can help maximise the sci- 
ence from each of them. 

The SKA PAthfinder Radio Continuum Survey 
(SPARCS) Working Group has therefore been es- 
tablished, under the auspices of the SKA project, 
with the following goals: 

• To coordinate developments of techniques, to 
avoid duplication of effort and ensure that 
each project has access to best practice, 

• To facilitate cross-project discussions of the 
specific science goals, to ensure cross-fertilisation 
of ideas and optimum survey strategies, 

• To coordinate the surveys in their choice of 
area, depth, location on the sky, and other 
survey parameters, to maximise the science 
return from the surveys. 

SPARCS consists of a core group of the lead- 
ers of the radio continuum surveys such as EMU, 
and is also open to the estimated 200 other as- 
tronomers who are engaged in the continuum sur- 
vey projects on SKA Pathfinders. The first meet- 
ing of SPARCS took place in February 2011 at 
the Lorentz Center in Leiden, The Netherlands, 
and was attended by about 60 astronomers. The 
Procee dings will be publish ed as a multi-author 
paper ( Norris et al.||20lfb ) in a refereed journal. 
SPARCS will continue to hold such meetings an- 
nually, as well as facilitating other interactions be- 
tween members of the various survey groups. 



4 EMU Observing and Oper- 
ations 

It is expected that the observations will take place 
commensally with other ASKAP surveys such as 
WALLABY and POSSUM, implying that observ- 
ing strategies and schedules will need to be agreed 
on by the different teams. The actual observing is 



expected to be conducted autonomously, with ob- 
serving schedules delivered to the telescope in ad- 
vance, and telescope performance monitored through- 
out the observations. Little or no interaction is ex- 
pected to take place between the astronomers and 
the telescope until the data products are delivered 
from the pipeline for quality control. 

Observations are expected to take place on one 
field at a time, rather than revisiting each field 
many times, to minimise the effect of source vari- 
ability and to minimise the large processing over- 
heads associated with combining several epochs of 
data. Typically, ASKAP will survey one field of 
30deg 2 for 12 hours before moving to the next, al- 
though this may be complicated by the need for 
dithering to achieve a uniform sensitivity across 
the field. 

4.1 Data Quality Control 

Data produced by the ASKAPsoft processing pipeline 
will be subject to a number of Data Quality checks 
before being released into the public domain, and 
this Quality Control process is the responsibility of 
the EMU team. In real-time, the statistics of each 
field will be compared with those obtained from 
other radio surveys, previous ASKAP pointings, 
and the expected performance of ASKAP, includ- 
ing: 

• rms noise level, 

• source counts of detected sources (i.e. do 
they fit published logN/logS distributions?), 

• spectral indices, 

• the circularly polarised source components 
(expected to be close to zero), 

• rotation measures of linearly polarised source 
components, 

• measured positions of cross-identified com- 
ponents, compared with their positions from 
multi- wavelength surveys. 

In the early days of the survey, these auto- 
mated tests will be followed by a visual inspec- 
tion of each image for artefacts or other problems. 
However, it is expected that as experience and con- 
fidence grows with time, the visual inspections will 
be replaced by automated tests. 

A further level of testing will be provided on 
a longer timescale by the cross-identification pro- 
gram, which will, for example, show up any sys- 
tematic positional errors. Additional quality con- 
trol data will come from the WTF program (sec- 
tion 2.11) which will not only identify astronom- 
ically interesting objects which do not fall into 
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known classes, but will also identify errors in the 
EMU data which may masquerade as a new class 
of objects. 

The final quality control step is from external 
users. As all data is released into the public do- 
main in near-real time, feedback will be encour- 
aged from external users of EMU data which will 
enable the identification of rare or subtle problems 
in the data. 

4.2 Data Release 

The primary data products consist of the uv data, 
image data, and a catalogue of radio components, 
as listed in Table [2j These data products are deliv- 
ered to the ASKAP Science Data Archive Facility 
(ASDAF) by the real-time processing pipeline, and 
are then subjected to the quality control processes 
described above. After passing quality control, the 
data will then be released into the public domain, 
with no proprietary period. Once routine observ- 
ing is established, the time taken for the quality 
control step is expected to be no more than about 
a day, although in the early days of the survey the 
data quality control step is likely to take longer. 

Because the time to process the primary data 
is comparable to the observing time, routine re- 
processing of data is not an option. It is therefore 
important that the primary data products are as 
final as possible, and it is likely that there will be 
only one data release for each field. 

The secondary data products consist of a cat- 
alogue of sources, each of which may consist of 
one or more components in the primary catalogue, 
together with cross-identifications with optical/IR 
data. This will also be produced by an automated 
pipeline which will run asynchronously from the 
primary data processing pipeline. The secondary 
data products will also be placed in the public do- 
main as soon as possible after processing, although 
the need to protect proprietary optical/IR data 
with privileged access is also recognised. 

It is likely that as additional survey data be- 
come available over time, and as expertise increases, 
the cross-identification process will also be refined. 
Several releases of the secondary catalogue are there- 
fore expected, each with a release number embed- 
ded in the source identifiers. 

Both the primary and secondary data will be 
available from the ASDAF using Virtual Observa- 
tory (VO) tools such as Simbad and TopCat, and 
through simple cone-search and cut-out servers. It 
is expected that they will also be available from the 
NASA Extragalactic Database (NED) and other 
astronomical databases. 



4.3 Outreach and Communications 

EMU provides an excellent opportunity to inform 
and engage the public about new developments 
in radio astronomy and ASKAP. An EMU web- 
site aimed at the public is currently under de- 
velopment. Initially it will be a fairly traditional 
"static" site but there is great potential for it to 
grow and incorporate new technologies as data be- 
come available, once ASKAP is o perat ional. Link- 
age with the Radio Zoo site (see ^3.9 ) should ex- 



pose it to a large and diverse audience. 

The advent of a new "Australian Curriculum" 
for both Science (K-10) and Physics (Years 11-12) 
(http://www.australiancurriculum.edu.au) over the 
next few years is an opportunity to develop new, 
targeted formal education resources utilising not 
just EMU data for the "Science Understanding" 
and "Science Inquiry Skills" strands but examples 
of scientists involved in EMU and ASKAP for the 
"Science as a Human Endeavour" strand for stu- 
dents across Australia. 

Communication within the EMU team is pri- 
marily via a closed Wiki, augmented by an elec- 
tronic newsletter produced three times per year. 

5 Conclusion 

The most obvious benefit from EMU is the legacy 
survey. Its greater depth, greater resolution, and 
program of cross-identifications will impact all ar- 
eas of astronomy. EMU's sensitivity means that 
the majority of objects in EMU will be star-forming 
galaxies rather than radio-loud AGNs. Thus EMU 
plays a key role in the continuing emergence of ex- 
tragalactic radio astronomy from the niche that 
it once occupied, focussing largely on radio-loud 
AGNs, to one that impacts on all areas of astron- 
omy. As a legacy survey, EMU will provide a re- 
source to all astronomers, leveraging astronomi- 
cal observations at all wavelengths both through 
the VO server and also through data centres like 
NED and CDS. It is likely to find rare, interest- 
ing targets (such as the ~100 EoR radio galaxies) 
that can then be followed up by other observatories 
such as ALMA, ELT, JWST, and Meerkat, which 
are better suited to deep pointed observations. 

In addition to the legacy program, this paper 
has described a number of science areas in which 
EMU is likely to have a major impact. For exam- 
ple, it will distinguish between alternative mod- 
els of cosmology and General Relativity, trace the 
evolution of star formation and AGNs, and signif- 
icantly increase the number of known clusters of 
galaxies. 

It is even possible that the greatest impact of 
EMU is on science that hasn't yet been thought 
of. EMU will occupy a new region of observational 
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phase space, and we plan to exploit this by min- 
ing the data for unexpected types of astronomical 
object. 

EMU is novel in that cross-identifications with 
all available data at optical, infrared, and radio 
wavelengths will be produced by the EMU cross- 
identification pipeline, and will be placed in the 
public domain, as a service to the community, as 
soon as the data quality has been checked. They 
will be made available using the VO server (and 
thus immediately via services such as Aladin and 
TopCat), through the EMU web page, and also 
through the ASKAP Science Data Archive Facility 
services. 

EMU is an open collaboration and we welcome 
interactions and collaborations with other surveys 
and other instruments, to increase our common 
scientific productivity. We also encourage other in- 
dividual scientists to join the EMU collaboration, 
to help generate even better science, by contacting 
Ray Norris. Further information on EMU can be 
found on http : / / askap.org/emu| 
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